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Title: US-10-070-532-4 
Perfect score: 1937 
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Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 2002273 seqs, 358729299 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2002273 



Database 



A_Geneseq_23Sep04 : * 

1: geneseqpl980s : * 

2 : geneseqpl990s : * 

3 : geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8 : geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAU00439 

ID AAU00439 standard; protein; 369 AA. 
XX 

AC AAU00439; ' " 
XX 

DT 17-MAY-2001 (first entry) 
XX 

DE Human neuropeptide receptor splice variant 1. 
XX 

KW Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 

KW nervous system disorder; hyperprolif erative disorder; diabetes mellitus; 

KW cardiovascular disorder; autoimmune disorder; infectious disorder; 

KW eating behaviour disorder; narcolepsy; neurological disease; 

KW narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 

KW protein co-ordinate data. 



XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Region 47. .72 

FT /label- TM1 

FT /note= "Transmembrane region 1" 

FT Region 83. .106 

FT /label= TM2 

FT /note= "Transmembrane region 2" 

FT Region 112. .142 

FT /label= TM3 

FT /note= "Transmembrane region 3" 

FT Region 163. .189 

FT /label= TM4 

FT 1 /note= "Transmembrane region 4" 

FT Region 214. .239 

FT /label= TM5 

FT /note= "Transmembrane region 5" 

FT Region 299. .327 

FT /label= TM6 

FT /note= "Transmembrane region 6" 

FT Region 335. .363 

FT /label= TM7 

FT /note= "Transmembrane region 7" 
XX 

PN WO200117532-A1. 
XX 

PD 15-MAR-2001. 
XX 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR 10-SEP-1999; 99US-00393696 . 



XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 

DR N-PSDB; AAS00492. 
XX 

PT A new nucleic acid encoding a human neuropeptide receptor polypeptide, 



PT useful for preventing, treating or ameliorating obesity, narcolepsy, 

PT neurological disease and addiction to narcotics, nicotine and alcohol. 
XX 

PS Claim 3; Fig 5; 385pp; English. 
XX 

CC The present sequence represents human neuropeptide receptor splice 

CC variant 1. Two splice variants (AAU00439-AAU00440) and a possible mutant 

CC (AAU00442) of a novel human neuropeptide receptor (AAU00438) are 

CC described. The neuropeptide receptor shows sequence homology to the 

CC neuropeptide Y receptor. Polypeptides and polynucleotides of the 

CC neuropeptide receptor are useful for diagnosing, preventing, or treating 

CC a pathological condition in a subject related to the central nervous and 

CC peripheral nervous systems (CNS and PNS) . The polypeptides and 

CC polynucleotides may be used to treat hyperprolif erative, cardiovascular, 

CC autoimmune, nervous system or infectious disorders e.g. cancer, heart 



CC disease, rheumatoid arthritis, Alzheimer f s disease, HIV infection and 

CC diabetes mellitus . In particular they are useful for preventing, treating 

CC or ameliorating a medical condition in a mammal such as obesity/eating 

CC behaviour disorders, narcolepsy, neurological disease, addiction to 

CC narcotics, nicotine and alcohol, chronic pain, acute pain, migraine 

CC headaches and anxiety disorders . The polynucleotides encoding the 

CC neuropeptide receptor can also be used in gene therapy methods for 

CC treating such diseases 

XX 

SQ Sequence 369 AA; 



Query Match 100.0%; Score 1937; DB 4; Length 369; 

Best Local Similarity 100.0%; Pred. No. 1.9e-189; 

Matches 369; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 ME P SAT P GAQMGVP PG S RE P S P VP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLI AAYVAVFWA 60 

Qy 61 LVGNT LVC LAVWRN H HMRTVTN Y F I VN L S LADVLVT AI C L PAS LLVD I T E S WL FGHALC K 120 

| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSJ^VLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVKQSSSVLPELANRTRLFSLCHERWAJDDLYPKIYHSCFFI VTTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVTVIQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFIAEVKQMRARRKTAKML 300 

Qy 301 MWL L VFAL C Y L P I S VLN VL K RVFGMF RQ AS D REAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLWLKRVFGMFRQASDREAWACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGLPWSLL 369 

I I I I I I I I I 
Db 361 LSGLPWSLL 369 



RESULT 2 
AAW06125 

ID AAW06125 standard; protein; 369 AA. 
XX 

AC AAW06125; 
XX 

DT 22-FEB-1997 (first entry) 
XX 

DE Neuropeptide receptor splice variant-1. 
XX 

KW Human; neuropeptide receptor; splice variant; drug screening; 

KW receptor-agonist; receptor-antagonist; anorectic; antitumour; 

KW anticholesterolemic; neuroprotective; anticonvulsant; hypotensive; 



KW sedative; diagnostic; gene therapy. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 47. .72 

FT /note= "Transmembrane region-1" 

FT Domain 83. .106 

FT /note= "Transmembrane region-2" 

FT Domain 112. .142 

FT /note= "Transmembrane region-3" 

FT Domain 163. .189 

FT /note= "Transmembrane region-4" 

FT Domain 214. .239 

FT /note= "Transmembrane region-5" 

FT Domain 299. .327 

FT /note= "Transmembrane region-6" 

FT Domain 335. .363 

FT /note= "Transmembrane region-7" 

XX 

PN W09634877-A1. 
XX 

PD 07-NOV-1996. 
XX 

PF 05-MAY-1995; 95WO-US005616 . 
XX 

PR 05-MAY-1995; 95WO-US005616 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 1996-506094/50. 

DR N-PSDB; AAT42827. 
XX 

PT Human neuro-peptide receptor polypeptide (s ) - used to identify 

PT antagonists and agonists to such polypeptide ( s ) , e.g. in the treatment of 

PT obesity, Alzheimer's disease, epilepsy, etc. 

XX 

PS Disclosure; Page 51-52; 77pp; English. 
XX 

CC The sequence represents human adult hypothalamus neuropeptide receptor 

CC splice variant-1, which retains activity corresponding to the mature 

CC receptor (AAW06124), which is structurally related to the G-protein- 

CC coupled receptor family. The receptor variant contains 7 transmembrane 

CC regions. The receptor may be produced in recombinant form and used -in a 

CC drug screening assay for isolation of receptor-agonists and -antagonists, 

CC which may be used as anorectic, antitumour, anticholesterolemic, 

CC neuroprotective, anticonvulsant, hypotensive or sedative drugs, etc. The 

CC encoding DNA may be used in genetic disease diagnosis or gene therapy. 

CC The receptor itself and its corresponding antibody may also be used in 

CC therapy and diagnosis 
XX 

SQ Sequence 369 AA; 



Query Match 

Best Local Similarity 



98.6%; 
98.6%; 



Score 1909; DB 2; Length 369; 
Pred. No. 1.4e-186; 



Matches 364; Conservative 2; Mismatches 



3; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 1 ME P SAT PGAQMGVP P GS RE P S PVP P D YEDE FLRYLWRD YL YP KQ YEWVX I AAYVAVFVVA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSIADVLWAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I M I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI P YLQAVS VS VAVLT LS F I AL D RW YAI CH P L L FK S T ARRARG S I LGI WAVS LAI MVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AViyiECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIvTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKT7VKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVTOWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVTiKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFAL C YL P I S VLN VLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGLPWSLL 369 

I I I I I I I I I 
Db 361 LSGLPWSLL 369 

RESULT 3 
AAW06124 

ID AAW06124 standard; protein; 402 AA. 
XX 

AC AAW06124; 
XX 

DT 22-FEB-1997 (first entry) 
XX 

DE Neuropeptide receptor. 
XX 

KW Human; neuropeptide receptor; drug screening; receptor-agonist; 
KW receptor-antagonist; anorectic; anti tumour; anticholesterolemic; 
KW neuroprotective; anticonvulsant; hypotensive; sedative; diagnostic; 
KW gene therapy. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Domain 47. .72 

FT /note= "Transmembrane region-1" 

FT Domain 83. .106 

FT /note= "Transmembrane region-2" 

FT Domain 112. .142 

FT /note= "Transmembrane region-3" 

FT Domain 163. .189 

FT /note= "Transmembrane region-4" 



FT 
FT 
FT 
FT 
FT 
FT 



Domain 



Domain 



Domain 



214. .239 

/no.te= "Transmembrane region-5" 
299. .327 

/note= "Transmembrane region-6" 
335. .363 

/note= "Transmembrane region-7" 



XX 

PN W09634877-A1. 
XX 

PD 07-NOV-1996. 
XX 

PF 05-MAY-1995; 95WO-US005616 . 
XX 

PR 05-MAY-1995; 95WO-US005616 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 1996-506094/50. 

DR N-PSDB; AAT42826. 
XX 

PT Human neuro-peptide receptor polypeptide ( s ) - used to identify 

PT antagonists and agonists to such polypeptide ( s ) , e.g. in the treatment of 

PT obesity, Alzheimer's disease, epilepsy, etc. 

XX 

PS Claim 1; Page 49-50; 77pp; English. 
XX 

CC The sequence represents a human adult hypothalamus neuropeptide receptor, 

CC structurally related to the G-protein-coupled receptor family. Splice 

CC variants are given in AAW06125-26. The receptor contains 7 transmembrane 

CC regions . The receptor may be produced in recombinant form and used in a 

CC drug screening assay for isolation of receptor-agonists and -antagonists, 

CC which may be used as anorectic, antitumour, anticholesterolemic, 

CC neuroprotective, anticonvulsant, hypotensive or sedative drugs, etc. The 

CC encoding DNA may be used in genetic disease diagnosis or gene therapy. 

CC The receptor itself and its corresponding antibody may also be used in 

CC therapy and diagnosis 
XX 

SQ Sequence 402 AA; 

Query Match 97.3%; Score 1884; DB 2; Length 402; 
Best Local Similarity 98.9%; Pred. No. 5.7e-184; 

Matches 359; Conservative 2; Mismatches 2; Indels 0; Gaps 0 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVA 60 



Db 



1 




Qy 



61 



LVGNTLVCLAWRNHHMRTWNYFIWLSU^VLVTAICLPASLLVDITESWLFGHALCK 12 0 




Db 



61 



LVGNT LVCIAVWRNHHMRT VTN Y FI VNL S LADVLVTAI CL PAS LLVD I T E S WL FGHALC K 120 



Qy 



121 



VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 





Db 



121 



VIPYLQAVSVSVAVXTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 18 0 



Qy 181 AVMQSSSVLPEIJ\NRTRLFSLCHERWADDL 240 

|||: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I M I I M M I I I M I I I I I I I I I I 'I I 
Db 181 AVMECSSVLPEIANRTRLFSVCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

| | M | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAAN P 1 1 YNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MVVLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 360 

Qy 361 LSG 363 

I II 

Db 361 LSG 363 



RESULT 4 
AAU00440 

ID AAU00440 standard; protein; 372 AA. 
XX 

AC AAU00440; 
XX 

DT 18-JUN-2001 (first entry) 
XX 

DE Human neuropeptide receptor splice variant 2. 
XX 

KW Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 

KW nervous system disorder; hyperprolif erative disorder; diabetes mellitus; 

KW cardiovascular disorder; autoimmune disorder; infectious disorder; 

KW eating behaviour disorder; narcolepsy; neurological disease; 

KW narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 

KW protein co-ordinate data. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Region 47. .72 

FT /label= TMl 

FT /note= "Transmembrane region 1" 

FT Region 83. .106 

FT /label- TM2 

FT /note- "Transmembrane region 2" 

FT Region 112. .142 

FT /label= TM3 

FT /note= "Transmembrane region 3" 

FT Region 163. .189 

FT /label- TM4 

FT /note- "Transmembrane region 4" 

FT Region 214. .239 

FT /label= TM5 

FT /note- "Transmembrane region 5" 

FT Region 299. .327 

FT /label= TM6 

FT /note= "Transmembrane region 6" 

FT Region 335. .363 

FT /label- TM7 



FT /note= "Transmembrane region 7" 
XX 

PN WO200117532-A1. , 
XX 

PD 15-MAR-2001. 
XX, 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR 10-SEP-1999; 99US-00393696. 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 

DR N-PSDB; AAS00493. 
XX 

PT A new nucleic acid encoding a human neuropeptide receptor polypeptide, 

PT useful for preventing, treating or ameliorating obesity, narcolepsy, 

PT neurological disease and addiction to narcotics, nicotine and alcohol. 
XX 

PS Claim 3; Fig 6; 385pp; English. 
XX 

CC The present sequence represents human neuropeptide receptor splice 

CC variant 2. Two splice variants (AAU00439-AAU0044 0) and a possible mutant 

CC (AAU00442) of a novel human neuropeptide receptor (AAU00438) are 

CC described. The neuropeptide receptor shows sequence homology to the 

CC neuropeptide Y receptor. Polypeptides and polynucleotides of the 

CC neuropeptide receptor are useful for diagnosing, preventing, or treating 

CC a pathological condition in a subject related to the central nervous and 

CC peripheral nervous systems (CNS and PNS) . The polypeptides and 

CC polynucleotides may be used to treat hyperprolif erative, cardiovascular, 

CC autoimmune, nervous system or infectious disorders e.g. cancer, heart 

CC disease, rheumatoid arthritis, Alzheimer's disease, HIV infection and 

CC diabetes mellitus . In particular they are useful for preventing, treating 

CC or ameliorating a medical condition in a mammal such as obesity/eating 

CC behaviour disorders, narcolepsy, neurological disease, addiction to 

CC narcotics, nicotine and alcohol, chronic pain, acute pain, migraine 

CC headaches and anxiety disorders. The polynucleotides encoding the 

CC neuropeptide receptor can also be used in gene therapy methods for 

CC treating such diseases 

XX 

SQ Sequence 372 AA; 

Query Match 96.8%; Score 1875; DB 4; Length 372; 
Best Local Similarity 98.6%; Pred. No. 4.3e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0 

Qy 1 ME P SAT PGAQMGVP P G S RE P S PVP P DYE D E FL RYLWRD Y L Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEP SAT PGAQMGVP PGS RE PS PVP P DYED EFLRYLWRDYLYPKQYEWVL I AAYVAVFVVA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLWAICLPASLLVDITESWLFGH7VLCK 12 0 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSI^VLWAICLPASLLVDITESWLFGHALCK 120 



Qy 



121 VI P Y LQAVS VS VAVLT L S F I P L DRW YAI CHPLLFKS T ARRARGS I L G I WAVS LAI MVP QA 180 



Db 



I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I i I I I I I I I I I I I ! I I I I I I 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 


181 


Db 


181 


Qy 


241 


Db 


1 A 1 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 
Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKI YHSCFFIVTYLAPLGLMAMAYFQI FR 2 4 0 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I | | | | I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AANP 1 1 YNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN SAAN P 1 1 YN F 360 



I I I 



RESULT 5 
AAW80805 

ID AAW80805 standard; protein; 389 AA. 
XX 

AC AAW80805; 
XX 

DT 29-JAN-1999 (first entry) 
XX 

DE Amino acid sequence of HFGAN72Y a G-protein coupled receptor. 
XX 

KW G-protein coupled receptor family; HFGAN72Y; mutation; probe; agonist; 

KW antagonist; activation; inhibition; gene therapy; antibody; 

KW immune response; vaccine; HIV-1; HIV-2; cancer; anorexia; bulimia; 

KW asthma; Parkinson's disease; acute heart failure; hypotension; 

KW hypertension; urinary retention; osteoporosis; angina pectoris; 

KW myocardial infarction; ulcer; allergies; psychotic disorder; 

KW neurological disorder; gene mapping. > 

XX . 

OS Homo sapiens . 
XX 

PN EP875565-A2. 
XX 

PD 04-NOV-1998. 
XX 

PF 27-OCT-1997; 97EP-00308554 . 

XX . 

PR 30-APR-1997; 97US-00846705 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis C; 
XX 

DR WPI; 1998-570286/49. 

DR N-PSDB; AAV68511. 
XX 

PT New G-protein coupled receptor HFGAN72Y polypeptide and polynucleotide - 

PT useful as diagnostic reagents and for prevention and treatment of HIV 



PT infections, cancer, osteoporosis and Parkinson's disease. 
XX 

PS Claim 1; Page 7; 22pp; English. 
XX 

CC This is the amino acid sequence of the G-protein coupled receptor, 

CC HFGAN72Y used in the method of the invention. HFGAN72Y polypeptides and 

CC polynucleotides are useful for diagnosing susceptibility to diseases by 

CC detecting mutations in the HFGAN72Y gene using probes containing the 

CC HFGAN72Y nucleotide sequence, and can diagnose diseases associated with 

CC HFGAN72Y imbalance by determining HFGAN72Y polypeptide or mRNA expression 

CC levels. Agonists/antagonists can be used in treatment to activate/inhibit 

CC HFGAN72Y activity, in addition to direct administration of antisense 

CC sequences to prevent expression, or HFGAN72Y polypeptides to treat 

CC conditions associated with a lack HFGAN72Y protein. Gene therapy may also 

CC be used to affect endogenous HFGAN72Y polypeptide production. HFGAN72Y 

CC antibodies are useful for inducing an immune response to immunise and 

CC prevent diseases, and for isolating HFGAN72Y clones or purifying the 

CC polypeptides by affinity chromatography. HFGAN72Y polypeptides can be 

CC administered directly or as a vaccine to inoculate against diseases. 

CC Diseases diagnosed, prevented or treated include HIV-1 or HIV-2 

CC infections, pain, cancers, anorexia, bulimia, asthma, Parkinson's 

CC disease, acute heart failure, hypotension, hypertension, urinary 

CC retention, osteoporosis, angina pectoris, myocardial infarction, ulcers; 

CC allergies, benign prostatic hypertrophy, and psychotic and neurological 

CC disorders. The HFGAN72Y polypeptide is also useful for mapping the gene 

CC to a chromosome, allowing gene inheritance to be studied through linkage 

CC analysis 

XX 

SQ Sequence 389 AA; 

Query Match 96.8%; Score 1875; DB 2; Length 389; 

Best Local Similarity 98.6%; Pred. No. 4.5e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIJVWRNHHMRTVTNYFIW 120 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHfflyiRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I LG I WAVS LAI MVP Q A 180 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I II I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKST7VRRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGUyiAMAYFQIFR 240 



Qy 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQP RGRAFLAEVKQMRARRKT AKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFIAEVKQMRARRKTAKML 300. 

Qy 301 MWLLVFAL C YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT F SHW LVYAN S AAN P 1 1 YN F 360 

| | | | | | | | I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN P 1 1 YNF 360 



Qy 

Db 



361 LSG 363 
I I I 

361 LSG 363 



RESULT 6 
AAU11187 

ID AAU11187 standard; protein; 389 AA. 
XX 

AC AAU11187; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFGAN72Y. 
XX 

KW Human; G protein-coupled receptor; GPCR; HFGAN72Y; bacterial infection; 

KW fungal infection; protozoan infection; viral infection; 

KW human immunodeficiency virus; HIV; cancer; diabetes; Parkinson 1 s disease; 

KW osteoporosis; myocardial infarction; ulcer; asthma; allergy; 

KW angina pectoris; renal disease; depression; schizophrenia; anorexia; 

KW obesity; Kallman's syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN US2001025031-A1. 
XX 

PD 27-SEP-2001. 
XX 

PF 06-APR-2001; 2001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-008 8524 P . 

PR 22-JUL-1998; 98US-0093726P . 

PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS C E. 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok C, Bodsworth NJ, Halsey W, Horn SV; 
XX 

DR WPI; 2001-624968/72. 
XX 

PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

PT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers. 
XX 

PS Claim 8; Fig 3; 75pp; English. 
XX 

CC The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

CC HFGAN72 is a G protein-coupled receptor (GPCR) . HFGAN72 is useful for the 



CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is 

CC administered by providing to the patient DNA * encoding HFGAN72 and 

CC expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV)'-l or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman f s syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72Y receptor being 

CC the product of a splice variant of HFGAN72 

XX 

SQ Sequence 389 AA; 

Query Match 96.8%; Score 1875; DB 4; Length 389; 

Best Local Similarity 98.6%; Pred. No. 4.5e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP PDYEDEFL RYLWRD Y L Y P KQ YEWVL I AAYVAVFVVA 60 

I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLS1ADVLWAICLPASLLVDITESWLFGHALC 120 

| I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLS^ 120 

Qy 121 VI P Y L Q AVS VS VAVLT L S F I P L D RW Y AI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI PYLQAVSVSVAVLTLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I II I I I I I I I I I I I I I I I I II 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQP RGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQ I PGTTS7VLVRNWKRPSDQLGDLEQGLSGE PQP RGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 7 
ABB08208 

ID ABB08208 standard; protein; 389 AA. 
XX 

AC ABB08208; 
XX 

DT 12-MAR-2002 (first entry) 



DE G-protein coupled receptor (HFGAN72Y) . 
XX 

KW G-protein; receptor; HFGAN72Y; cytostatic; cardiant; analgesic; cancer; 

KW nootropic; tranquillising; neuroprotective; anti-asthmatic; gene therapy; 

KW infection; HIV-1; pain; anorexia; bulimia; Parkinson's disease; ulcer; 

KW cardiac disease; urinary retention; asthma; allergy; psychotic disorder; 

KW benign prostatic hypertrophy; neurological disorder; anxiety; delirium; 

KW schizophrenia; manic depression; dementia; mental retardation; 

KW dyskinesia; Huntington's disease; Tourette 1 s syndrome; HIV-2 . 
XX 

OS Homo sapiens. 
XX 

PN EP1156110-A2. 
XX 

PD 21-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203010 . 
XX 

PR 30-APR-1997; 97US-00846705 . 

PR 27-OCT-1997; 97EP-00308554 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-084320/12. 

DR N-PSDB; ABA96019. 
XX 

PT New polynucleotide encoding a G-protein coupled receptor designated 

PT HFGAN72Y is useful to diagnose and treat associated diseases including 

PT cancer, infection, cardiac disease and psychotic and neurological 

PT disorders. 
XX 

PS Claim 10; Page 7; 22pp; English. 
XX 

CC The sequence represents G-protein coupled receptor HFGAN72Y. The 

CC invention relates to a novel isolated polynucleotide encoding HFGAN72Y 

CC polypeptide. The polypeptide of the invention has cytostatic, cardiant, 

CC analgesic, tranquillising, nootropic, neuroprotective, and anti-asthmatic 

CC activity. The HFGAN72Y has a use in gene therapy. The HFGAN72Y 

CC polynucleotide or an HFGAN72Y polypeptide agonist are used to treat a 

CC subject in need of enhanced HFGAN72Y activity or expression. An HFGAN72Y 

CC antagonist or competitor, or nucleic acid which inhibits HFGAN72Y 

CC expression is used to treat a subject in need of decreased HFGAN72Y 
CC'"-." activity or expression. HFGAN72Y-associated diseases include infections, 

CC particularly by HIV-1 or HIV-2, cancers, anorexia, bulimia, Parkinson's 

CC disease, cardiac diseases, ulcers, urinary retention, asthma, allergies, 

CC benign prostatic hypertrophy, and psychotic and neurological disorders 

CC including anxiety, schizophrenia, manic depression, delirium, dementia, 

CC severe mental retardation and dyskinesias such as Huntington's disease 

CC and Tourette 's syndrome, and pain 
XX 

SQ Sequence 389 AA; 



Query Match 96.8%; Score 1875; DB 5;. Length 389; 

Best Local Similarity 98.6%; Pred. No. 4.5e~183; 



Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 



0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l.l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNT LVC LAVWRNHHMRT VTN Y FI VNL S LADVLVT AI CL PAS LLVD I T E S WL FGHALCK 120 

I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVClAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGaALCK 120 

Qy 121 VI P Y LQ AVS VS VAVLT L S F I P L D RW Y AI CHPLLFKS T ARRARG S I L G I W AVS LAI MVP Q A 180 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I U I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI P YLQAVSVS VAVLT L S F I AL D RW YAI CH P L L FKS T ARRARG S I LG I WAVS LAI MVP QA 180 

Qy 181 AW[QSSSVLPELANRTRLFSLCHERW7yDDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 

Db 181 AVKECSSvXiPELT^RTRLFSVCDERWADDLYPKIYHSCFFIWYIxAPLGLMAMAYFQIFR 24 0 

Qy 241 KLWGRQIPGTTSALVT^WKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFAL C YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HWLVYAN SAANP 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 8 
AAW80456 

ID AAW80456 standard; protein; 425 AA. 
XX 

AC AAW80456; 
XX 

DT 26-JAN-1999 (first entry) 
XX 

DE G-protein coupled receptor (HFGAN72X) polypeptide. 
XX 

KW G-protein coupled receptor; HFGAN72X; HIV infection; anorexia; cancer; 

KW bulimia; asthma; Parkinson's disease; acute heart failure; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW benign prostatic hypertrophy; neurological disorder. 

XX . 

OS Homo sapiens. 

XX 

PN EP875566-A2. 
XX 

PD 04-NOV-1998. 
XX 

PF 27-OCT-1997; 97EP-00308563 . 
XX 

PR 30-APR-1997; 97US-00846704 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 



XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 1998-559432/48. 

DR N-PSDB; AAV63468. 
XX 

PT New human G-protein coupled receptor HFGAN72X polypeptide and 

PT polynucleotide - useful as diagnostic reagents and for treating e.g. HIV 

PT infection, cancer and Parkinson's disease. 

XX 

PS Claim 1; Page 7-8; 24pp; English. 
XX 

CC The present sequence represents a G-protein coupled receptor (HFGAN72X) 

CC polypeptide. HFGAN72X polypeptides and polynucleotides are useful for 

CC diagnosing diseases related to over or under expression of HFGAN72X 

CC proteins by identifying mutations in the HFGAN72X gene using HFGAN72X 

CC probes, or determining HFGAN72X protein or mRNA expression levels. 

CC HFGAN72X polypeptides are also useful for screening for compounds which 

CC affect activity of the protein. Diseases that can be treated with 

CC HFGAN72X include HIV infections, pain, anorexia, cancers, bulimia, 

CC asthma, Parkinson's disease, acute heart failure, hypotension, 

CC hypertension, urinary retention, osteoporosis, angina pectoris, 

CC myocardial infarction, ulcers, allergies, benign prostatic hypertrophy, 

CC and psychotic and neurological disorders 

XX 

SQ Sequence 425 AA; 

Query Match 96.8%; Score 1875; DB 2; Length 425; 

Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YE D E FL RYLWRD Y L Y P KQ Y EWVL I AAYVAVFWA 60 

I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I II II I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLI AAYVAVFWA 60 

Qy 61 LVGNT LVC LAW RN HHMRT VTN Y F I VN L S LADVLVT AI C L PAS LL VD I T ESWL FGHAL C K 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVXVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P YLQAVSVSVAVLTLS FI P LDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI PYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 

Qy 181 AVKQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I 
Db 181 AVT^ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFI VTYLAPLGLMAMAYFQIFR 240^ 

Qy 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANP 1 1 YNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YL P I SVLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYANSAAN P 1 1 YN F 360 



Qy 



361 LSG 363 
I I I 



Db 



361 LSG 363 



RESULT 9 
AAU11188 

ID AAU11188 standard; protein; 425 AA. 
XX 

AC ^ AAU11188; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFGAN72X variant. 
XX 

KW Human; G protein-coupled receptor; GPCR; HFGAN72X; bacterial infection; 

KW fungal infection; protozoan infection; viral infection; 

KW human immunodeficiency virus; HIV; cancer; diabetes; Parkinson's disease; 

KW osteoporosis; myocardial infarction;, ulcer; asthma; allergy; 

KW angina pectoris; renal disease; depression; schizophrenia; anorexia; 

KW obesity; Kallman's syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN US2001025031-A1. 
XX 

PD 27-SEP-2001. 
XX 

PF 06-APR-2001; 2001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-0088524P . 

PR 22-JUL-1998; 98US-0093726P . 

PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS C E. 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok C, Bodsworth NJ, Halsey W, Horn SV; 
XX 

DR WPI; 2001-624968/72. 

DR N-PSDB; AAS17464. 
XX 

PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

PT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers. 
XX 

PS Claim 23; Fig 6; 75pp; English. 
XX 

CC The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

CC HFGAN72 is a G protein-coupled receptor (GPCR). HFGAN72 is useful for the 

CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is ' 

CC administered by providing to the patient DNA encoding HFGAN72 and 



CC expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV)-l or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman's syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72X variant, 

CC encoded by an alternative allele of HFGAN72 

XX 

SQ Sequence 425 AA; 



Query Match 96.8%; Score 1875; DB 4; Length 425; 

Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCLAWRNHHMRTWNYFI V14 


120 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VI P YLQAVSVS VAVLTLS FI PLDRWYAI CHP LL FKSTARRARGS I LGI WAVS LAIMVPQA 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


VI PYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 


180 


Qy 


181 


AWQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 


240 






III: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7VRRKTAKML 


300 


Qy 


301 


MWLLVFALC YLPI S VLN VLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAAN P 1 1 YNF 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


MWLLVFALC YL P I S VLN VLKRVFGMFRQAS D REAVYAC FT FSHWLVYAN S AAN P II YN F 


360 


Qy 


361 


LSG 363 




Db 


361 


III 

LSG 363 





RESULT 10 
AAU11186 

ID AAU11186 standard; protein; 425 AA. 
XX 

AC AAU11186; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFGAN72X. 



XX 

KW Human; G protein-coupled receptor; GPCR; HFGAN72X; bacterial infection; 

KW fungal infection; protozoan infection; viral infection; 

KW human immunodeficiency virus; HIV; cancer; diabetes; Parkinson's disease; 

KW osteoporosis; myocardial infarction; ulcer; asthma; allergy; 

KW angina pectoris; renal disease; depression; schizophrenia; anorexia; 

KW obesity; Kallman's syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN US2001025031-A1. 
XX 

PD 27-SEP-2001. 
XX 

PF 06-APR-2001; 2001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-0088524P . 

PR 22-JUL-1998; 98US-0093726P . 

PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS C E . 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok C, Bodsworth NJ, Halsey W, Horn SV; 
XX 

DR WPI; 2001-624968/72. 
XX 

PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

PT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers. 
XX ' 

PS Claim 8; Fig 2; 75pp; English. 
XX 

CC The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

CC HFGAN72 is a G protein-coupled receptor (GPCR) . HFGAN72 is useful for the 

CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is 

CC administered by providing to the patient DNA encoding HFGAN72 and 

CC expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the" detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV)-l or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman's syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72X receptor being 

CC the product of a splice variant of HFGAN72 

XX 



SQ Sequence 425 AA; 



Query Match 96.8%; Score 1875; DB 4; Length 425; 

Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 



Qy 


1 


ME P SAT P GAQMGVP P GS RE P S PVP P D YEDE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 


ou 




| M | I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 M M 1 1 1 1 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTVTNYFIW 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 




Db 


61 


LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHAlLCK 


120 


Qy 


121 


VI PYLQAVSVSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 I 1 1 1 1 1 ' 1 [ [ 1 1 1 [ 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 » 1 1 1 1 1 1 1 1 1 1 1 <>■•■■■ > 




VJVJ 


121 


VI PYLQAVSVSVAVLTLS FIALDRWYAI CHPLLFKSTARRARGS ILGIWAVS LAIMVPQA 


180 


Qy 


181 


AVMQSSSVLPELANRTRLFSLCHERWAIDDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 


240 




|||: I I I I 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


AWIECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFI VT 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 




I I I I I 1 I 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 


360 




I | M || M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YNF 


360 


Qy 


361 


LSG 363 




Db 


361 


1 1 1 

LSG 363 





RESULT 11 
AAB67079 

ID AAB67079 standard; protein; 425 AA. 
XX 

AC AAB67079; 
XX 

DT 10-APR-2001 (first entry) 
XX 

DE Human HFGAN72 receptor SEQ ID NO: 13. 
XX 

KW Human; mouse; rat; Lig72A; Lig72B; neuropeptide receptor; HFGAN72 ; - • 
KW truncation mutant; ligand; neurodegenerative disorder; pain; 
KW eating disorder; behaviour disorder; mood disorder. 
XX 

OS Homo sapiens. 
XX 

PN WO200100787-A2. 
XX 

PD 04-JAN-2001. 
XX 

PF 22-JUN-2000; 2000WO-US017251 . 
XX 



PR 25-JUN-1999; 99US-014 1156P . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Bingham S, Darker J, Liu W, Martin JD, Parsons AA, Patel SR; 
XX 

DR WPI; 2001-071483/08. 
XX 



PT Polynucleotides encoding Lig 72A polypeptides or their variants , which 

PT are useful in the treatment of a disease or disorder associated with 

PT pain, e.g. enhanced or exaggerated sensitivity to pain, hyperalgesia, 

PT neuropathic pain and back pain. 
XX 

PS Claim 8; Fig 7; lOlpp; English. 
XX 

CC The present invention provides the protein and coding sequences for the 

CC human, mouse and rat HFGAN receptor ligand Lig72A. It also provides 

CC truncated mutant versions. These, and their agonists and antagonists, are 

CC all useful in the treatment of eating, neurodegenerative, behaviour, 



CC , mood, sexual, hormonal and sleep disorders, pain, depression, epilepsy 

CC and acute inflammatory conditions 

XX 

SQ Sequence 425 AA; 

Query Match 96.8%; Score 1875; DB 4; ■ Length 425; 

Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 L VGNT LVC LAVWRNHHMRTVTN Y FI VN L S LAD VLVT AI C L PAS LLVD I T E S WL FGHALC K 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTOTNYFIWLSLADV 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVKQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: 111111111111111:1 I I I II I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 181 AVMECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy • 241 KLWGRQIPGTTSALVT^KRPSDQLGDLEQGLSGEPQPR^ 300 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFL7VEVKQMRARRKTAKML 300 

Qy 301 MWL L VFAL C YL P I S VLN VL K RVFGM FRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MVVLLVFALC YL P I SVLNVLKRVFGMFRQ AS D REAVYAC FT FSHWLVYANS AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 12 
AAG78345 

ID AAG78345 standard; protein; 425 AA. 
XX 

AC AAG78345; 
XX 

DT 22-JAN-2002 (first entry) 
XX 

DE Human HFGAN72X G coupled receptor polypeptide. 
XX 

KW Antibacterial; fungicide; virucide; protozoacide ; anti-HIV; analgesic; 

KW cytostatic; nootropic; antiparkinsonian; cardiant; antiulcer; 

KW antiasthmatic; tranquiliser ; neuroleptic; antidepressant; anticonvulsant 

KW osteopathic; HIV infection; pain; cancer; anorexia; bulimia; 

KW Parkinson f s disease; acute heart failure; hypotension; hypertension; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW ulcers; asthma; allergy; delirium; dementia; 

KW benign prostatic hypertrophy; anxiety; schizophrenia; manic depression; 

KW dyskinesia; G coupled receptor; HFGAN72X; 7 transmembrane receptor. 
XX 

OS Homo sapiens. 
XX 

PN EP1154019-A2. 
XX 

PD 14-NOV-2001. 
XX 

PF 27-OCT-1997; 2 001EP-002 03008 . 
XX 

PR 30-APR-1997; 97US-00846704 . 

PR 27-OCT-1997; 97EP-00308563 .' 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 

XX , 

DR WPI; 2002-012659/02. 

DR N-PSDB; AAI64172. 

XX 

PT Nucleic acid encoding the HFGAN72X receptor, useful for diagnosis and 

PT treatment of e.g. infections, cancer, anorexia, bulimia, Parkinson's 

PT disease, and acute heart failure. 
XX 

PS Claim 11; Page 7-8; 24pp; English. 
XX 

CC The present sequence is that of a 'human HFGAN72X polypeptide encoded by 

CC cDNA shown in AAI64172. The specification describes a newly isolated 

CC polynucleotide encoding a HFGAN72X G coupled receptor polypeptide. The 

CC protein of the invention has antibacterial, fungicide, virucide, 

CC protozoacide, anti-HIV, cardiant, analgesic, cytostatic, nootropic, 

CC antiparkinsonian, antiulcer, antiasthmatic, tranquiliser, neuroleptic, 

CC antidepressant, anticonvulsant and osteopathic activities. HFGAN72X 

CC polynucleotides (PNs) are used to express HFGAN72X in vivo, to treat 

CC diseases requiring increased activity or expression of HFGAN72X; for 

CC recombinant production of HFGAN72X; diagnose diseases by detecting 

CC mutations in genomic sequences and in chromosome identification and 

CC mapping. HFGAN72X polypeptides are used to raise specific antibodies; as 



CC therapeutic agents; to identify HFGAN72X protein-expressing clones; to 

CC purify HFGAN72X proteins; in vaccines. Cells transformed with HFGAN72X 

CC PNs are used to identify (ant ) agonists of HFGAN72X, useful 

CC therapeutically. Nucleic acids that inhibit expression of HFGAN72X and 

CC polypeptides that compete with ligands for binding to HFGAN72X proteins 

CC are also useful therapeutically and diagnostically . HFGAN72X-related 

CC diseases include infections (bacterial,, viral, fungal or protozoal, 

CC particularly HIV-1 or -2); pain; cancer; anorexia; bulimia; Parkinson's 

CC disease; acute heart failure; hypotension; hypertension; urinary 

CC retention; osteoporosis; angina pectoris; myocardial infarction; ulcers; 

CC asthma; allergy; benign prostatic hypertrophy; anxiety; schizophrenia; 

CC manic depression; delirium; dementia; severe mental retardation and 

CC dyskinesias 

XX 

SQ Sequence 425 AA; 



Query Match 96.8%; Score 1875; DB 5; Length 425; 

Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YE D E FL RYLWRD Y L Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIAWRNHHMRTVTNYF 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCl^WRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVPQ A 180 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFI7VLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVKQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVT^ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDL EQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVT^WKRPSDQLGDLEQGLSGEPQPRGRAFLAEVT<QMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANSAAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFAL C Y L P I S VLNVLKRVFGM FRQAS D REAVYAC FT FS HW LVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 13 
ABP81941 

ID ABP81941 standard; protein; 425 AA. 
XX 

AC ABP81941; 
XX 

DT 04-MAR-2003 (first entry) 
XX 



DE Human orexin receptor 1 protein SEQ ID NO: 368. 
XX 

KW G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 

KW G protein-coupled receptor modulator; antibody; immune-related disease; 

KW growth-related disease; cell regeneration-related disease; AIDS; cancer; 

KW immunological-related cell proliferative disease; autoimmune disease; 

KW Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 

KW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 

KW graft versus host ' disease ; Parkinson's disease; multiple sclerosis; pain; 

KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 

KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 

KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 

KW ulcer. 

XX 

OS Homo sapiens. 
XX 

PN WO200261087-A2. 
XX 

PD 08-AUG-2002. 
XX 

PF 19-DEC-2001; 2001WO-US050107 . 
XX 

PR 19-DEC-2000; 2000US-0257144P . 
XX 

PA (LIFE- ) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 

DR N-PSDB; ABZ42789. 
XX 

PT' New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR) , useful for diagnosing and designing drugs for treating conditions 

PT in which GPCRs are involved, e.g. AIDS, Alzheimer's disease, cancer or 

PT autoimmune diseases. 

XX 

PS Disclosure; Fig 1; 523pp; English. 
XX 

CC The present invention describes antigenic peptides (I) comprising: (a) 

CC any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 

CC acids. Also described: (1) an assay for the detection of a particular G 

CC protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 

CC and (2) an isolated antibody having high specificity and high affinity or 

CC avidity for a particular GPCR. (I) can be used as GPCR modulators and in 

CC gene therapy. The antigenic peptides for GPCRs are useful in detecting an 

CC antibody against a particular GPCR, and in the production r ^of specific 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

CC treating immune-related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological-related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer's disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn's disease, diabetes, graft versus host 

CC disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 

CC anxiety, depression, schizophrenia, dementia, mental retardation, memory 



CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, > ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and immunodia gnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 425 AA; 



Query Match 96.8%; 
Best Local Similarity 98.6%; 
Matches 358; Conservative 



Score 1875; DB 6; Length 425; 
Pred. No. 5.1e-183; 
2; Mismatches 3; Indels 0; 



Gaps 



0; 



Qy. 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ME P S AT PGAQMGVP PGS RE P S PVP P D YEDE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

LVGNTLVCLAWRNHHMRTWNYFIWLSLA^ 120 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LVGNTLVC^WRNHHMRTWNYFIWLSLADvTjW 120 

VI PYLQAVS VSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

VI PYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 

AVMQSSSVLPELT^RTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 
Ml: I I I I I I I I I I I I I I I : I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I 

AVTVIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 3 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

KLWGRQIPGTTSALVT^WKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300^ 

MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAAN PIIYNF 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAWACFTFSHWLVYANSAANPI I YNF 360 



I I I 



RESULT 14 
ABG75058 

ID ABG75058 standard; protein; 425 AA. 
XX-. 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 



ABG75058; 

12-FEB-2004 (first entry) 
Human orexin receptor 1 protein. 

Energy homeostasis; mouse; metabolism; triglyceride; metabolic disease; 
gene therapy; anorectic; immunomodulator ; antidiabetic; hypotensive; 
cardiant; osteopathic; antilipemic. 

Homo sapiens. 



XX 

PN WO2003075945-A2. 
XX 

PD 18-SEP-2003. 
XX 

PF 14-MAR-2003; 2003WO-EP0027 14 . 
XX 

PR 14-MAR-2002; 2002EP-00005882 . 

PR 15-MAR-2002; 2002EP-00006012 . 

PR 20-MAR-2002; 2002EP-00006271 . 

PR 25-MAR-2002; 2002EP-00006810 . 
XX 

PA (DEVE-) DEVELOGEN ENTWICKLUNGSBIOLOGISCHE FORSCH. 
XX 

PI Eulenberg K, Steuernagel A, Haeder T, Broenner G; 
XX 

DR WPI; 2003-748334/70. 

DR N-PSDB; ACH00818. 
XX 

PT New pharmaceutical composition comprising a nucleic acid molecule 

PT encoding proteins regulating the energy homeostasis and metabolism of 

PT triglycerides useful for detecting or preventing metabolic diseases, e.g. 

PT obesity. 

XX 

PS Claim 3; Fig 7G; 140pp; English. 
XX 

CC The present invention relates to pharmaceutical compositions comprising 

CC the coding sequences shown in ACH00815-ACH00827 , or their encoded 

CC proteins (shown in ABG75054, ABG75056-ABG75067 ) . These are proteins 

CC involved in the metabolism of triglycerides and in energy homeostasis , 

CC and their coding sequences . The composition is useful for the manufacture 

CC of an agent for detecting, verifying, treating, alleviating or preventing 

CC disorders, including metabolic diseases such as obesity and other body- 

CC weight regulation disorders as well as related disorders such as 

CC metabolic syndrome, eating disorder, cachexia, diabetes mellitus, 

CC hypertension, coronary heart disease, hypercholesterolaemia, 

CC dyslipidaemia, osteoarthritis or gallstones, in cells, cell masses, 

CC organs or subjects. The coding sequences can be used in the production of 

CC transgenic animals which under- or over-produce the gene of interest. The 

CC " present sequence is a protein of the invention 

XX 

SQ Sequence 425 AA; 

Query Match 96.8%; Score 1875; DB 7; Length 425; 
Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLWAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 61 LVGNTLVC^WRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 



121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 
I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



180 



Db 



121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVS LAIMVPQA 180 



Qy 181 AVMQSSSVLPELANRTRLFSLCHERWAI)DLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I II I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIV^ 240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSAXVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAAN P 1 1 YN F 360 

Qy 361 LSG 363 

I II v 
Db 361 LSG 363 



RESULT 15 
ADK52564 

ID ADK52564 standard; protein; 425 AA. 
XX 

AC ADK52564; 

XX 

DT 06-MAY-2004 (first entry) 
XX 

DE Hematological disorder associated Gene ID 14393 encoded protein. 
XX 

KW cytostatic; antianemic; antisickling; virucide; hemostatic; nephrotropic; 

KW cytostatic; thrombolytic; antiparasitic; gene therapy; 

KW hematologic disorder; cancer; Sickle Cell Anemia; 

KW Infectious Mononucleosis; Leukemia; Polycythemia Vera; Lymphoma; 

KW Retinoblastoma; Hemophilia; Thrombosis; Herpes; Thalassemia; 

KW transfusion reaction; Erythroblastosis; mechanical trauma; 

KW micro-angiopathic hemolytic anemia; parasite infection. 

XX 

OS Homo sapiens. 
XX 

PN WO2003065871-A2. 
XX 

PD 14-AUG-2003. 
XX 

PF 28-JAN-2003; 2003WO-US002484 . 
XX 

PR 04-FEB-2002; 2002US-G354333P . 

PR 28-FEB-2002; 2002US-0360258P . 

PR 15-MAR-2002; 2002US-0364476P . 

PR 26-APR-2002; 2002US-0375626P . 

PR 06-JUN-2002; 2002US-0386494P . 

PR 24-JUN-2002; 2002US-0390965P . 

PR 28-JUN-2002; 2002US-03924 80P . 

PR 03-JUL-2002; 2002US-0394128P . 

PR 31-JUL-2002; 2002US-0399783P . 

PR 13-AUG-2002; 2002US-0403221P . 

PR 30-AUG-2002; 2002US-04 07045P . 

PR 25-NOV-2002; 2002US-0429048P . 



XX 

PA (MILL-) MILLENNIUM PHARM INC. 
XX 

PI Carroll JM, Healy A, Weich NS, Kelly LM; 
XX 

DR WPI; 2003-731464/69. 

DR N-PSDB; ADK52563. 
XX 

PT Identifying a compound capable of treating a hematologic disorder (e.g. 

PT anemia or leukemia) comprises assaying the ability of the compound to 

PT modulate the expression or activity of e.g. 131,148, 199 or 12303 

PT polypeptide or nucleic acid. 
XX 

PS Disclosure; SEQ ID NO 22; 232pp; English. 
XX 

CC The invention relates to a method of identifying a compound capable of 

CC treating a hematologic disorder comprises assaying the ability of the 

CC compound to modulate 131,148, 199, 12303, 13906, 15513, 17822, 302, 5677, 

CC 194, 14393, 28059, 7366, 12212, 1981, 261, 12416, 270, 1410, 137, 1871, 

CC 13051, 1847, 1849, 15402, 340, 10217, 837, 1761, 8990 or 13249 nucleic 

CC acid expression or polypeptide activity, thus, identifying a compound 

CC capable of treating a hematologic disorder. The methods are useful in 

CC diagnosing, preventing and treating hematological disorders, such as 

CC cancer, Sickle Cell Anemia, Infectious Mononucleosis, Leukemia, 

CC Polycythemia Vera, Lymphoma, Retinoblastoma, Hemophilia, disorders 

CC associated with an increased risk of Thrombosis, Herpes, Thalassemia, 

CC antibody-mediated disorders such as transfusion reactions and 

CC Erythroblastosis, mechanical trauma to red blood cells such as micro- 

CC angiopathic hemolytic anemias, infections by parasites or chemical 

CC injuries. The methods may also be used for identifying compounds that 

CC modulate hematological disorders. This sequence corresponds to the 

CC protein encoded by one of the genes modulated by the compounds. 

XX 

SQ Sequence 425 AA; 

Query Match 96.8%; Score 1875; DB 7; Length 425; 

Best Local Similarity 98.6%; Pred. No. 5.1e-183; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEP SAT PGAQMGVP PGS REP S PVP PD YEDEFLRYLWRDYL YPKQYEWVLI AAYVAVFVVA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ME P S AT P GAQMGVP P GS REP SP VP P DYE DEFLRYLWRDYLYPKQYEWVlj I AAYVAVFVVA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db * 61 L VGN T L VC LAVW RN H HMRT VT N Y F I VN L S LAD VL VT AI C L PAS Lii VD I T E S W L F GHAL C K 120 



Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVPQ A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VI P YLQAVS VS VAVLT L S F I AL D RW YAI CH P L L FK S TARRARG S I LG I WAVS LAIMVPQA 180 

Qy 181 AVMQSSSVljPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I | I : I I I I I I II I I I I I I I : I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERW7VDDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



QY 



241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 
| | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



D b 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI I YNF 360 

I | | I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I II II I I 

D b 301 MWLLVFALCYLP I SVLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN P 1 1 YNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



Search completed: October 14, 2004, 10:50:38 
Job time : 90.9271 sees 



GenCore version 
Copyright (c) 1993 - 2004 



5.1.6 

Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



October 14, 2004, 10:43:25 ; Search time 22.4691 Seconds 

(without alignments) 
1089.110 Million cell updates/sec 



Title: US-10-070-532-4 
Perfect score: 1937 

Sequence: 1 ME P SAT P GAQMGVP P G S RE P NSAANPII YNFLSGLPWSLL 369 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 478139 seqs, 66318000 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 



478139 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/l/iaa/5A_COMB.pep:* 

2: /cgn2_6/ptodata/l/iaa/5B_COMB.pep:* 

3: /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB . pep : * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB . pep : * 

6: /cgn2_6/ptodata/l/iaa/backfilesl.pep: * 

Pred. No. is the ^number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 

Match Length 


DB 


ID 


Description 




1 


1909 


98.6 


369 


4 


US-08-462-509B-4 


Sequence 


4, 


Appli 


2 


1909 


98.6 


369 


5 


PCT-US95-05616-4 


Sequence 


4, 


Appli 


3 


1875 • 


96.8 


377 


5 


PCT-US95-05616-6 


Sequence 


6, 


Appli 


4 


1875 


96.8 


389 


2 


US-08-846-705-2 


Sequence 


2, 


Appli 


5 


1875 


96.8 


389 


4 


US-09-211-823C-23 


Sequence 


23 


, Appl 


6 


1875 


96.8 


402 


4 


US-08-462-509B-2 


Sequence 


2, 


Appli 


7 


1875 


96.8 


402 


5 


PCT-US95-05616-2 


Sequence 


2, 


Appli 


8 


1875 


96.8 


425 


3 


US-08-846-704-2 


Sequence 


2, 


Appli 


9 


1875 


96.8 


425 


4 


US-09-211-823C-22 


Sequence 


22 


, Appl 


10 


1872 


96.6 


372 


4 


US-08-462-509B-6 


Sequence 


6, 


Appli 


11 


1869 


96.5 


402 


3 


US-08-846-704-4 


Sequence 


4, 


Appli 



1 9 


1869 


96 


. 5 


425 


3 


us- 


09- 


479- 


128-2 


Sequence 


2, Appli 


1 1 


1313 . 5 


67 


. 8 


444 


4 


us- 


09- 


426- 


290-2 


Sequence 


2, Appli 


14 


1312 . 5 


67 


. 8 


444 


3 


us- 


09- 


119- 


788-2 


Sequence 


2, Appli 


15 


1279 


66 


. 0 


263 


3 


us- 


08- 


513- 


974B-54 


Sequence 


54 , Appl 


x u 


1279 


66 


. 0 


263 


3 


us- 


08- 


513- 


974B-376 


Sequence 


376, App 


1 7 

X / 


1279 


66 


. 0 


263 


4 


us- 


09- 


461- 


436B-54 


Sequence 


54, Appl 


1 ft 

X o 


483.5 


25 


. 0 


430 


3 


us- 


09- 


255- 


368-8 


Sequence 


8, Appli 


X -7 




25 


. 0 


430 


4 


us- 


09- 


405- 


558-8 


Sequence 


8, Appli 


Z U 


4 ft n s 

ft O U ■ -J 


94 

Z 1 


ft 

. (J 


432 


3 


US- 


09- 


255- 


368-2 


Spnuence 

kJ ^ I^J, V_. 


2, Appli 


9 1 

Z X 


4 ft 0 S 


94 

Z 4 


. 8 


432 


4 


us- 


09- 


405- 


558-2 


Sequence 


2, Appli 


99 
z z 


4 7^ S 

ft / O . J 


94 

Z ft 


4 


4 90 

ft C, \J 


3 


us- 


09- 


255- 


368-6 


Sfttu price 


6, Appli 


9 ^ 
z o 


47^ S 

ft / O • J 


94 
z. i 


4 
» ft 


420 


4 


us- 


09- 


405- 


558-6 


Sequence 


6, Appli 


94 
z ft 


4 67 S 


94 
z. i 


a 1 


417 


4 


us- 


09- 


405- 


558-44 


Sequence 


44, Appl 


9 S 
Z D 


41 ^ 


9 1 
z X 


4 
• ft 


370 


3 


us- 


09- 


172- 


353-2 


Sequence 


2, Appli 


9£ 

Z D 


4 1 S 


9 1 
c X 


t 4 


370 


3 


us- 


09- 


172- 


353-3 


Sequence 


3 , App 1 i 


97 
Z / 


41^ 

l X J 


91 
z X 


4 
• ft 


370 


4 


US- 


09- 


799- 


955-2 


Sequence 


2, Appli 


9 ft 
Z 0 


4 1 5 


91 

Z X 


4 
. ft 


370 


4 


us- 


09- 


799- 


955-3 


Sequence 


3, Appli 


9 Q 
z 


*i X X * D 


9 1 

Z X 


9 


^70 


3 


US- 


09- 


172- 


353-7 


Sequence 


7, Appli 


"30 


411 R 


9 1 

Z X 


9 
. z 


^70 


4 
ft 


KJ O 


09- 


799- 


955-7 


Secruence 


7 Aooli 


^ 1 


411 

ft X X 


9 1 

Z X 


o 
. z 


^70 




u o 




51 3- 
■~> j. -j 


974B-96 


\A U^Xl V V-r 


9 6 Annl 


"5 9 

oz 


411 

fi X X 


9 1 
Z X 


. z 








Oft- 


S1 3- 

>J x o 


_7 1 ft JD O Z. «J 




323 Ado 




4 11 
1 X X 


9 1 
Z X 


9 
• z 


^70 


"3 


US- 


09- 


172- 


353-5 


Spat] pn r*p 


5 Annl i 


^4 
o fi 


411 
ft X X 


9 1 

Z X 


9 


37 0 


3 


US- 


08- 


776- 


971-21 


Sequence 


21, Appl 




411 
ft X X 


9 1 

Z X 


9 
• z 


^70 


o 


KJ O 


Oft- 


776- 


971-104 


Spaupnce 

t^J U^ll^v^ 


104. Add 


o O 


411 

ft X X 


91 

Z X 


9 


^70 


4 
ft 


nq- 

\J o 


0 Q- 




-955-5 


Spciupti ce 


S AddI i 


o / 


411 
fl X X 


91 

Z X 


o 
. z 


^7 0 


ft 




0Q- 


-4 61 - 




Spfiupn rp 


26. AddI 


O o 


411 
4 X X 


9 1 

Z X 


9 
• z 


^70 


4 
ft 


vJ ►_> 


09- 


-576- 


•290-21 


S e fin en ce 


21, AddI 




411 

ft X X 


21 


.2 


^70 


4 
ft 


US- 


09- 


•576- 


■290-104 


Senuence 

vJ v4 v-- ± i 


104, App 


4 o 


4 04 S 


20 


.9 


370 


3 


us- 


08- 


-776- 




Sequence 


140, App 


41 


404.5 


20 


.9 


370 


4 


us- 


■09- 


■576- 


-290-140 


Sequence 


14 0, App 


42 


401.5 


20 


.7 


369 


4 


us- 


■09- 


-170- 


-496D-178 


Sequence 


178, App 


43 


400.5 


20 


.7 


369 


3 


us~ 


-09- 


-172- 


-353-6 


Sequence 


6, Appli 


44 


400.5 


20.7 


369 


4 


us- 


-09- 


•799- 


-955-6 


Sequence 


6, Appli 


45 


400.5 


20 


.7 


369 


4 


us- 


■09- 


•170- 


-496D-26 


Sequence 


26, Appl 



ALIGNMENTS 



RESULT 1 

US-08-462-509B-4 

; Sequence 4, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

; APPLICANT: Soppet, Daniel et al 

; TITLE OF INVENTION: Human Neuropeptide Receptor--' 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS : 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
CITY: Rockiville 
STATE : MD 
COUNTRY: USA 
; ZIP: 20850 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/4 62 , 509B 
FILING DATE: 05-JUN-1995 
; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: WO PCT/US95/05616 

FILING DATE: 05-MAY-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales , Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/ DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 301-309-8504 

; TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 369 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-462-509B-4 



Query Match 98.6%; Score 1909; DB 4; Length 369; 

Best Local Similarity 98.6%; Pred. No. 2.6e-188; 

Matches 364; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 MEP S AT PGAQMGVP PGS REP S PVP PD YEDEFLRYLWRDYL YPKQYEWVLI AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1 MEP SATPGAQMGVP PGS REPS PVP PDYEDEFLRYLWRDYLYPKQYEWVLI AAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVTiWAICLPASLLVDITESWLFGE 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I.I I M I I I I I I I I I I 
Db 121 VI P YLQAVS VS VAVLT L S F I AL D RW YAI CH P LL FKS T ARRARG S I LG I WAVS LAIMVPQA 180 

Qy 181 AVTVIQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVKECSSVTjPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLTVEVKQMRARRKTAKML 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFAL C YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 



Qy 361 LSGLPWSLL 369 

I I I I I I I I I 
Db 361 LSGLPWSLL 369 



RESULT 2 

PCT-US95-05616-4 

; Sequence 4, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 
; CITY: ROSELAND 

STATE: NEW JERSEY 
; COUNTRY: USA 

; ZIP: 07068 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 

SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 

FILING DATE: concurrently 

CLASSIFICATION: 
; ATTORNEY/AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 

REFERENCE/ DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 369 BASE PAIRS 

TYPE: AMINO ACID 

STRANDEDNESS: SINGLE 

TOPOLOGY: LINEAR 
; MOLECULE TYPE: cDNA 
PCT-US95-05616-4 

Query Match 98.6%; Score 1909; DB 5; Length 369; 

Best Local Similarity 98.6%; Pred. No. 2.6e-188; 

Matches 364; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I . 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCl^WRNHHMRTWNYFIWLSLAD 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 LVGNTLVCKAWRNHHMRTWNYFIWLSL7VDVXWAICLPASLLVDITESWL 120 

Qy 121 VI P Y LQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 

Db 121 VI P YLQAVS VSVAVLTLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 18 0 

Qy 181 AW[QSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYIAPLGLM7^AYFQIFR 240 



Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKT7VKML 300 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7VRRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGM FRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALC YL P I SVLNVLKRVFGMFRQ AS D REAVYAC FT FS HWLVYAN SAAN P 1 1 YN F 360 

Qy 361 LSGLPWSLL 369 



361 LSGLPWSLL 369 



RESULT 3 

PCT-US95-05616-6 

; Sequence 6, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 
; TITLE OF INVENTION : Human Neuropeptide Receptor 
; NUMBER OF SEQUENCES : 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
ADDRESSEE: 'CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND 
; STATE: NEW JERSEY 

COUNTRY: USA 
ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
; COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 
; FILING DATE: concurrently 

; CLASSIFICATION: 

. ATTORNEY/AGENT INFORMATION: 
; NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 
REFERENCE/DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 
; . V TELEPHONE: 201-994-1700 
; TELEFAX : 201-994-1744 

; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 377 BASE PAIRS 

; TYPE: AMINO ACID 

STRANDEDNESS: SINGLE 
TOPOLOGY: LINEAR 
MOLECULE TYPE: cDNA 
PCT-US95-05616-6 



Query Match 



96.8%; Score 1875; DB 5; Length 377; 



Best Local Similarity 98.6%; Pred. No. 8.6e-185; 
Matches 358; Conservative 2; Mismatches 3; Indels 



0; Gaps 



0; 



Qy 


1 


ME P SAT P GAQMGVP P GS RE P S P VP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 


60 




1 1 I 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 M M M 1 1 M 




Db 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTVTNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 




Db 


61 


LVGNTLVCLAWRNHHMRTWNYFIWLSIJVDVLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VI PYLQAVSVSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


121 


VI PYLQAVSVSVAVLTLS FIALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 


180 


Qy 


181 


AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 




III' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

|||» | | 1 | | 1 1 1 1 1 1 1 1 1 1 * 1 | l | | | 1 1 | 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ • 1 111! 




Db 


181 


AWECSSVLPEIxANRTRLFSVCDERWAI)DLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 




I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 




Db 


241 


KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 


300 


Qy 


301 


MWL LVFALC Y L P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 


360 




| | | | | | I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


301 


MWLLVFALCYLP I S VLNVL KRVFGMFRQAS D REAVYAC FT FSHWLVYANS AAN P 1 1 YNF 


360 


Qy 


361 


LSG 363 




Db 


361 


III 

LSG 363 





RESULT 4 
US-08-846-705-2 

; Sequence 2, Application US/08846705 

; Patent No. 5935814 

; GENERAL INFORMATION: 

; APPLICANT: BERGSMA, DERK J. 

; APPLICANT: ELLIS, CATHERINE E 

TITLE OF INVENTION: NOVEL G- PROTEIN COUPLED 
; NUMBER OF SEQUENCES: ' 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
; STREET: P.O. BOX 98 0 

CITY: VALLEY FORGE 

STATE: PA 

COUNTRY : USA 

ZIP: 19482 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846, 705 

FILING DATE: 30-APR-1997 

CLASSIFICATION: 435 



; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 
; FILING DATE: 

; ATTORNEY/ AGENT INFORMATION: 
; NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 

REFERENCE/DOCKET NUMBER: GH-70003 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 
; TELEFAX: 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 38 9 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-846-705-2 

Query Match 96.8%; Score 1875; DB 2; Length 389; 

Best Local Similarity 98.6%; Pred. No. 9e-185; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YEDEFLRYLWRD YL YP KQ YEWVIj I AAYVAVFVVA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTWNYFIWLSL 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVC^WRNHHMRTWNYFIWLSI^VLWAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P YLQAVS VSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI PYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

Qy 181 AVT^QSSSVLPEIANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I II I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7VRRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVTOWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLP I S VXNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 301 MWLLVFALCYLP I SVLNVLKRVFGMFRQASDREAVYACFT FSHWLVYANSAAN PI I YNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 5 

US-09-211-823C-23 

; Sequence 23, Application US/09211823C 
; Patent No. 6664229 



GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 

TERRETT, JONATHAN ALEXANDER 
UPTON, NEIL 
PIPER, DAVID 
SMITH, MARTIN IAN 
KENNETT, GUY ANTHONY 
PATEL, SARAS WAT I R. 

METHODS OF TREATMENT USING NOVEL LIGANDS 
OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 



TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 23 
LENGTH: 389 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-23 



Query Match 96.8%; 
Best Local Similarity 98.6%; 
Matches 358; Conservative 



Score 1875; DB 4; 
Pred. No. 9e-185; 
2; Mismatches 3; 



Length 38 9; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWL'lAAYVAVFVVA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 61 LVGNT LVCLAVWRNHHMRTVTN YFI VNL S LADVLVTAI CLPAS LLVDI T E SWL FGHALCK 12 0 

I I I 1 1 1,1 I I 1 1 1 I I I 1 1 I I I 1 1 1 1 I I 1 1 I I I 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 I I I I I I 

Db 61 LVGNTLVCI^WRNHHMRTVTNYFIWLSLADV 120 

Qy 121 VI P YLQAVSVSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS ILGIWAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI PYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGS ILGIWAVS LAIMVPQA 180 

Qy 181 AVT^QSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYIAPLGLMAMAYFQIFR 240 

III: I I I I I I i I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVTVIECSSVTjPELANRTRLFSVCDERWAIDDLYPKIYHSCFFIVTYLAPLGLMT^MAYFQIFR 240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKT AKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN PIT YN F 360 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFAL C Y L P I S VLN VLKRVFGM F RQAS D REAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 



Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 6 

US-08-462-509B-2 

; Sequence 2, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; NUMBER OF SEQUENCES: 12 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Human Genome Sciences, Inc. 

; STREET: 9410 Key West Avenue 

; CITY: Rockiville 

STATE: MD 

COUNTRY: USA 

ZIP: 20850 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/462, 509B 

FILING DATE: 05-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/ 05616 

FILING DATE: 05-MAY-1995 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 
; REFERENCE/ DOCKET NUMBER: PF168P1 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 
; TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 402 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-462-509B-2 

Query Match 96.8%; Score 1875; DB 4; Length 402; 

.Best Local Similarity 98.6%; Pred. No. 9.4e-185; ; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YE D E FL RYLWRD YL Y P KQ Y EWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIAWRNHHMRTWNYFIW 120 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 VI P YLQAVSVSVAVLTLS FIALDRWYAICHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

Qy 181 AVMQSSSVLPEIJ^RTRLFSLCHERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFR 240 

|||: I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

| I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I II I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7VRRKTAKML 300 

Qy 301 MWL LVFALC Y L P I S VLNVL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 301 MVVLLVFALCYLPI S VLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANPI I YNF 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 7 

PCT-US95-05616-2 

; Sequence 2, Application PC/TUS9505616 
; GENERAL INFORMATION : 

APPLICANT: LI, ET AL. 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

ADDRESSEE: CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
; CITY: ROSELAND 

STATE: NEW JERSEY 
COUNTRY: USA 
; ZIP: 07068 

COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 
; FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
; NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 
; REFERENCE/DOCKET NUMBER: 325800-268 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 
; TELEFAX: 201-994-1744 

; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 402 AMINO ACIDS 

TYPE: AMINO ACID 
; STRANDEDNESS: 

TOPOLOGY: LINEAR 



MOLECULE TYPE: PROTEIN 
PCT-US95-05616-2 



Query Match 96.8%; Score 1875; DB 5; Length 402; 

Best Local Similarity 98.6%; Pred. No. 9.4e-185; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I LI 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNT LVCLAVWRNHHMRT VTN Y F I VN L S LADVLVT AI C L PAS L LVD I T E S WL FGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LVGNTLVCLAWRNHHMRTWNYFIWLSLADVXWAICLPASLLVDITESWLFGHALCK 120 

VI P Y LQ AVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
VI P YLQAVSVS VAVLT L S F I AL D RW YAI CH P L L FKS TARRARG S I L G I WAVS LAIMVPQ A 18 0 

AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLM7^[AYFQIFR 24 0 

Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I M 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQI PGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I | | | | | | I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
KLWGRQI PGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

MWLLVFAL C YL P I S VLN VL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 



I I I 



RESULT 8 
US-08-846-704-2 

Sequence 2, Application US/08846704 
Patent No. 6020157 
GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 
APPLICANT: ELLIS , CATHERINE E. 
TITLE OF INVENTION: NOVEL G- PROTEIN COUPLED 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS-:* 

ADDRESSEE: RATNER & PRESTIA 
STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
STATE: PA . 
COUNTRY: USA 
ZIP: 19482 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



; CURRENT APPLICATION DATA: 

; - APPLICATION NUMBER:- US/08/84 6, 704 

FILING DATE: 30-APR-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 
; NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 

REFERENCE/ DOCKET NUMBER: GH-70002 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 } 

TELEFAX: 610-407-0701 
; TELEX: 846169 

; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 425 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-08-846-704-2 

Query Match 96.8%; Score 1875; DB 3; Length 425; 

Best Local Similarity 98.6%; Pred. No. le-184; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLWAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLWAICLPASLLVT)ITESWLFGHALCK 120 

Qy 121 VI P YLQAVS VS VAVLT L S FI P L D RW YAI CH P L L FK S TARRARG S I L GI WAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VI PYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR7VRGSILGIWAVS LAIMVPQA 180 

Qy 181 AVMQSSSVIjPELANRTRLFSLCHERWADDLYPKIYHSC^ 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I 
Db 181 AWIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIvTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVl^WKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I ii 1 1 1 1 1 1 1 I 1 1 1 1 I I I I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 Ml I I I I 

Db 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFAL C YL P I S VLN VLKRVFGMFRQAS D REAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFAL C Y L P I S VLNVL KRVFGMFRQ AS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 9 

US-09~211^823C-22 

Sequence 22, Application US/09211823C 
Patent No. 6664229 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 

TERRETT, JONATHAN ALEXANDER 
UPTON, NEIL 
PIPER, DAVID 
SMITH, MARTIN IAN 
KENNETT, GUY ANTHONY 
PATEL, SARASWATI R. 

METHODS OF TREATMENT USING NOVEL LIGANDS 
OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 



TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 22 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-22 



Query Match 96.8%; 
Best Local Similarity 98.6%; 
Matches 358; Conservative 



Score 1875; DB 4; 
Pred. No. le-184; 
2; Mismatches 3; 



Length 425; 



Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

61 L VGNT L VC LAVWRNHHMRT VTN Y F I VN L S LAD VLVT AI C L PAS L LVD I T E S WL FGHALC K 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 LVGNTLVCIAVTtfRNHHMRTVTNYFIWLSLMVLW 12 0 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



121 



121 



181 



VI P Y LQAVS VS VAVLT L S F I P L DRW YAI CHPLLFKS T ARRARG S I LG I WAVS LAI MVP QA 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 



180 



180 



240 



AVMQSSSVXPEI^RTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

III: I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

241 KLWGRQ I P GTT S ALVRNWKRP S DQ L G D LEQ GL S GE P Q P RGRAFLAEVKQMRARRKT AKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I 
241 KLWGRQIPGTT SAL VRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKT AKML 300 

301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 MWL LVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 



Qy 

Db 



361 LSG 363 
I I I 

361 LSG 363 



RESULT 10 
US-08-462-509B-6 

; Sequence 6, Application US/08462509B 
; Patent No. 6410701 
; GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences , Inc. 

; STREET: 9410 Key West Avenue 

; CITY: Rockiville 

STATE: MD 

COUNTRY: USA 
; ZIP: 20850 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE:. Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/4 62 , 509B 
; FILING DATE: 05-JUN-1995 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/05616 

FILING DATE: 05-MAY-1995 
ATTORNEY/AGENT INFORMATION: 

NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 

REFERENCE/DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 372 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-462-509B-6 

Query Match 96.6%; Score 1872; DB 4; Length 372; 

Best Local Similarity 98.3%; Pred. No. 1.7e-184; 

Matches 357; Conservative 3; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YE DE FL R YLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ME P SAT P GAQMGVP P GS RD P S PVP P D YEDEFLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 



Qy 



61 LVGNT LVCLAVWRNHHMRT VTN Y F I VN L S LADVLVT AI C L PAS L LVD I T E S WL FGHALCK 120 



Db 



61 



I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



QY 



121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 





Db 



121 VI P YLQAVSVSVAVLTLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 



Db 



181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYH'SCFFIWYLAPLGLMAMAYFQIFR 240 



Qy 



241 KLWGRQIPGTTSALVTOWKRPSDQLGDLEQGLSGEPQPRGRAFIJVEVKQMRARRKTAKML 300 




Db 



241 KLWGRQI PGTT S ALVRNWKRP S DQLGDLEQGLS GEPQ PRGRAFLAEVKQMRARRKT7VKML 300 



Qy 



Db 



301 MVVLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAWACFT FSHWLVYANSAANP 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 MWL LVFALC Y L P I S VLNVLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 



Qy 



Db 



361 LSG 363 
I I I 

361 LSG 363 



RESULT 11 
US-08-846-704-4 

; Sequence 4, Application US/08846704 

; Patent No. 6020157 

; GENERAL INFORMATION: 

; APPLICANT: BERGSMA, DERK J. 

APPLICANT: ELLIS, CATHERINE E. 
; TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 
NUMBER OF SEQUENCES : 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
; STATE: PA 

COUNTRY: USA 
ZIP: 19482 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

; SOFTWARE: FastSEQ for Windows Version 2.0 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846, 704 

FILING DATE:. 30-APR-1997 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 

NAME: PRESTIA, PAUL F 
; REGISTRATION NUMBER: 23,031 

; REFERENCE/DOCKET NUMBER: GH-70002 

; TELECOMMUNICATION INFORMATION: 



TELEPHONE: 610-407-0700 
; TELEFAX: 610-407-0701 

; TELEX: 846169 

; INFORMATION FOR SEQ ID NO: 4: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 4 02 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-846-704-4 

Query Match 96.5%; Score 1869; DB 3; Length 402; 

Best Local Similarity 98.3%; Pred. No. 3.9e-184; 

Matches 357; Conservative 2; Mismatches 4; Indels .0; Gaps 0; 



Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YE D E FL RYLWRD Y L Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIJUDV^ 120 



Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CH P L L FK S T ARRARG S I L GI WAVS LAI MVPQ A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIP YLQAVS VSVAVLTLSFIALDRWYAICHPLLFKSTARRARGS I LGIWAVS LAI MVPQA 180 

Qy 181 AVTXIQSSSVLPEl^RTRLFSLCHERWADDLYPKIYHSCFFIWY^PLGLMAMAYFQIFR 240 

ML: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 



Qy 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFAL C Y L P I S VLNVLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFAL C Y L P I S VLNVLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 12 
US-09-479-128-2 

; Sequence 2, Application US/09479128 
; Patent No. 6319710 
; GENERAL INFORMATION: 

; APPLICANT: Berglind Ran Olafsdottir 
; APPLICANT: Jeffrey Gulcher 

; TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 

; FILE REFERENCE: 234 5.1005-001 

; CURRENT APPLICATION NUMBER: US/09/479,128 

; CURRENT FILING DATE: 2000-01-07 

; PRIOR APPLICATION NUMBER: US 09/379,083 



; PRIOR FILING DATE: 1999-08-23 
; NUMBER OF SEQ ID NOS : 22 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 425 
; TYPE: PRT 

ORGANISM: Homo Sapiens 
US-09-479-128-2 

Query Match 96.5%; Score 1869; DB 3; Length 425; 

Best Local Similarity 98.3%; Pred. No. 4.2e-184; 

Matches 357; "Conservative 2; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 

Db 1 

Qy 61 

Db 61 

Qy 121 

Db 121 

Qy 181 

Db 181 

Qy 241 

Db 241 

Qy 301 

Db 301 

Qy 361 

Db 361 



RESULT 13 
US-09-426-290-2 

; Sequence 2, Application US/09426290 
; Patent No. 6410712 
; GENERAL INFORMATION: 

; APPLICANT: Berglind Ran Olafsdottir 
; APPLICANT: Jeffrey Gulcher 

; TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 

; FILE REFERENCE: 2345.2 001-000 

; CURRENT APPLICATION NUMBER: US/ 09/426, 290 

; CURRENT FILING DATE: 1999-10-25 

; NUMBER OF SEQ ID NOS: 24 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 4 44 



MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I LI I I I I 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

LVGNT LVCLAVWRNHHMRT VTN YFI VNL S LADVLVT AI C L PAS LLVD I T E SWL FGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LVGNTLVCLAWRNHHMRTWNYFIWLSL7VDVLVTAICLPASLLVDITESWLFGHALCK 120 

VI P YLQAVSVS VAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAI MVP Q A 180 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

VI PYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAI MVPQA 180 

AVMQSSSVXPEL7\NRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

KLWGRQIPGTTSALVWWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AANP 1 1 YN F 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MWL LVFALC Y L P I S VLNVL KRVFGMFRQAS DREAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 

LSG 363 
I I I 

LSG 363 



TYPE: PRT 

ORGANISM: Homo Sapiens 
US-09-426-290-2 

Query Match 67.8%; Score 1313.5; DB 4; Length 444; 

Best Local Similarity 70.0%; Pred. No. 7.8e-127; 

Matches 250; Conservative 37; Mismatches 49; Indels 21; Gaps 3; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIAAWAVBWALVGNTLVCIAVWRNH 75 

:: I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTWNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVX 135 

I I I I I I II I I I I I I I I I I I I I I I I I : I : I I I I I : I I I : I I I I I I I I I I I I I I : I I 
Db 84 HMRTWNYFIWLSIADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 14 3 

Qy 136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVTy[QSSSVLPELANR 195 

III I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I : I I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 203 

Qy 196 TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : : I III : : I I I : I I I I I : I I I : I I I II : I I I I I I I I I I I I I I I : I : 

Db • 204 TTLFTVCDERWGGEIYPKMYHICFFLWYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAE VKQMRARRKT AKMLMVVL L V 306 

III I I I I I I I I : I I : I I I I I I I : I I I I I I I I 

Db 264 RKWK PLQPVSQPRGPGQPTKSRMSAVAAEIKQIRARRKTARMLMWLLV 312 

Qy 307 FALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN SAAN P 1 1 YN FL S G 363 

II : I I I I I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I 

Db 313 FAICYLPISILNVLKRVFGMFAHTEDRETWAWFTFSHWLVYANSAANPIIYNFLSG 369 



RESULT 14 
US-09-119-788-2 

; Sequence 2, Application US/09119788 
; Patent No. 6166193 
; GENERAL INFORMATION: 

APPLICANT: Yanagisawa, Masashi 

TITLE OF INVENTION: CDNA CLONE MYl THAT ENCODES 

TITLE OF INVENTION: A NOVEL HUMAN 7 -TRANSMEMBRANE RECEPTOR 

NUMBER OF SEQUENCES: 2 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: SmithKline Beecham Corporation 
STREET: 709 Swedeland Road 
; CITY: King of Prussia , - 

; STATE : PA 

; COUNTRY: United States of America 

; ZIP: 19406 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/119, 788 
; FILING DATE: 21-JUL-1998 



CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/053,790 
FILING DATE: 25-JUL-1997 
ATTORNEY/AGENT INFORMATION: 
NAME: King, William T 
REGISTRATION NUMBER: 30,954 
; REFERENCE/ DOCKET NUMBER: GH50029 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-270-5515 
; TELEFAX: 610-270-5090 

TELEX : 

; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 444 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-119-788-2 



Query Match 67,8%; Score 1312.5; DB 3; Length 444; 

Best Local Similarity 70.0%; Pred. No. 9.9e-127; 

Matches 250; Conservative 37; Mismatches 49; Indels 21; Gaps 3; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCIAWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVXIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I III I I I I I I I I I II : I : I I I I I : I II : I I I I I I I I I I I I I I : I I 
Db 84 HMRTWNYFIWLSIJVDVLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 



Qy 136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 

III I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I : I I III: 
Db 144 TLSCIALDRWYAICHPIJ^FKSTAKRT^NSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 203 



Qy 196 TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : : I III : : I I I : I I I I I : I I I : I I I II : I I I I I I I I I I I I I I I : I : 
Db 204 TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFL7VEVKQMRARRKTAKMLMWLLV 306 

III I I I I I I I I : I I : I I I I I I I : I I I I I I I I 

Db 2 64 RKWK P LQ P VS QPRGPGQPTKS RMGAVAAE I KQ I RARRKT ARMLMWL LV 312 

Qy 307 FALCYLP I S VXNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN FL S G 353 

I I : I I I I I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I 
Db 313 FAI C YL P I S I LNVTiKRVFGMFAHT EDRETVYAWFT FS HWLVYAN S AAN P 1 1 YN FL S G 369 



RESULT 15 
US-08-513-974B-54 

; Sequence 54, Application US/08513974B 

; Patent No. 6114139 

; GENERAL INFORMATION: 

APPLICANT: Hinuma, Shuji 
APPLICANT: Hosoya, Masaki 



APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki, Tetsuya 
APPLICANT: Fukusumi, Shoji 
APPLICANT: Ohgi, Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN, 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 380 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DIKE, BRONSTEIN, ROBERTS & CUSHMAN, LLP 

STREET: 130 Water Street 

CITY: Boston 

STATE: MA 

COUNTRY: USA 

ZIP : 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version. #1 . 30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/513 , 974B 

FILING DATE: 14-SEP-1995 

CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/JP95/ 01599 

FILING DATE: 10-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 

FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 

FILING DATE: 16-MAR-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 

FILING DATE: 20-JAN-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-326611 

FILING DATE: 28-DEC-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-270017 

FILING DATE: 02-NOV-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 

FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

-APPLICATION NUMBER: JP 6-236356 - vr - 

FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189274 

FILING DATE: ll-AUG-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189273 

FILING DATE: ll-AUG-1945 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189272 

FILING DATE: ll-AUG-1994 
ATTORNEY/AGENT INFORMATION: 



; NAME: Resnick, David S. 

; REGISTRATION NUMBER: 34,235 

; REFERENCE/ DOCKET NUMBER: 45753 

TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 617-523-3400 

; TELEFAX: 617-523-6440 

; INFORMATION FOR SEQ ID NO: 54: 

; SEQUENCE CHARACTERISTICS: 

; LENGTH: 263 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: 

; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 
US-08-513-974B-54 

Query Match 66.0%; Score 1279; DB 3; Length 263; 

Best Local Similarity 94.3%; Pred. No. 1.4e-123; 

Matches 248; Conservative 4; Mismatches 11; Indels 0; Gaps 0; 



Qy 91 AD VLVT AI C L PAS L LVD I T E S WL FGHAL C KVI P Y LQAVS VS VAVLT L S F I P L D RW YAI CH 150 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1 ADVLVTAI CLPAS LLVDI TESWLFGHALCKVI P YLQAVS VSVWLTLS S I ALDRWYAI CH 60 

Qy 151 PLLFKSTARRAJ^GSILGIWAVSLAIMVPQAAVTCQS 210 

I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I I I I I I I I I I I I I : I I I I I I II 
Db 61 PLLFKSTARRARGSILGIWAVSLAVWPQAAWIECSSVLPELANRTRLLSVCDERWADDL 120 

Qy 211 YPKIYHSCFFIWYLAPLGLM7^YFQIFRKLWGRQIPGTTS7\LVRNWKRPSDQLGDLEQ 270 

I I I I I I I I I U I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 YPKI YHSCFFIVTYLAPLGLMAMAYFQI FRKLWGRQI PGTTSALVRNWKRPSDQLDDQGQ 180 

Qy 271 GLSGEPQP RGRAFLAE VKQMRARRKT AKMLMVV^ L VFALC YL P I S VLN VL KRVFGM FRQA 330 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GLSSEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQA 240 

Qy 331 S D REAVYAC FT F S HWLVYAN S AA 353 

I I II I : I I I I I I I I II I I I I I I I 
Db 241 S D REAI YAC FT F S HWLVYAN S AA 263 



Search completed: October 14, 2004, 10:58:12 
Job time : 23.4691 sees 



GenCore version - 5 . 1 . 6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein — protein search, using sw model 
Run on : 



October 14, 2004, 10:42:40 ; Search time 19.6209 Seconds 

(without alignments) 
1809.496 Million cell updates/sec 



Title: US-10-07 0-532-4 

Perfect score: 1937 

1 ME P SAT P GAQMGVP P G S RE P NSAANPIIYNFLSGLPWSLL 369 



Sequence : 
Scoring table 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database : 



PIR_79:* 
pirl : * 
pir2:* 
pir3:* 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 
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1 
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neurokinin 3 recep 
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19.5 
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11 
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19.3 
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12 
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19.2 
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2 
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gastrin receptor - 


13 
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19.2 
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2 


159336 


galanin receptor 1 



14 
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1 
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2 
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15 
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16 


370.5 
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8 
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18 . 


7 
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28 
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18 . 


4 
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2 


S23510 


n pi l role i n i n 1 rprpn 


29 
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18 . 


3 


407 


1 


JQ1274 


llCUJlUJVUlXil X LCUCp 






1 ft 


9 
z. 


4 ft 0 

*i O VJ 


9 

Z 


1 JJU JO 


ueLci x ciu.x7ericxgxc 


31 
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18 . 


2 


423 


2 


JC7677 


dXXa.UvJoL.ClL.Xll ICL-C|J 
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i 

X 


4 D7 




A14^S7 


neuroKinin x xecep 


33 
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X <J • 


i 
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•J J7 T 


2 

z 




n 1 ati i n rpfpirhnr — 

LjClXclllXll ICLCty LUX 


34 
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18 . 


1 
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1 
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X 
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Z 




neuroKinxn x xecep 


36 
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o 
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2 


jf S S 9 9 
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LlIUXCL-yO LU JS.X11XI1 / V 
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1 7 
x / • 


Q 

Z) 
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O U T 


9 

z 
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ii" U.XLJ jvxiixii z. ictcp 
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^4 6 S 


1 7 
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Q 


4 4^ 


9 

Z 


D4 0470 


yXULUUUi. UXtUXU XilU 


39 
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17 . 


9 
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2 


A4 1783 


T'^r'H^/lri ni n rprprifo 

LdL-liy JTwXllXll ICL-Cp 


40 
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17 . 


8 
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2 


B41007 


tjomlDPsin rprpntor 

vyxLUk> v — o _i_ xx -i— ^ — k> o w J— ^ 


41 
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17. 


6 
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2 


A25896 


beta-adrenergic re 


42 
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17. 


5 
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2 


S36794 


beta-l-adrenergic 


43 
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17. 


4 
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2 


S12591 


beta-l-adrenergic 


44 
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17. 


3 


398 


2 


JN0708 


thyrotropin- releas 


45 


336 


17. 


3 


411 


2 


156444 


thyrotrophin-relea 



ALIGNMENTS 



RESULT 1 
A41738 

neuropeptide Y receptor - fruit fly (Drosophila melanogaster ) 
N;Alternate names: G protein-coupled receptor PR4 
C; Species: Drosophila melanogaster 

C;Date: 16-Sep-1992 #sequence_revision 16-Sep-1992 #text_change 09-Jul-2004 
C; Accession: A41738 

R; Li, X.J.; Wu, Y.N.; North, R.A. ; Forte, M. 
J. Biol. Chem. 267, 9-12, 1992 

A; Title : Cloning, functional expression, and developmental regulation of a 

neuropeptide Y receptor from Drosophila melanogaster. 

A; Reference number: A41738; MUID : 92 112730 ; PMID: 1370455 

A; Access ion: A41738 

A;Molecule type: mRNA 

A; Residues: 1-449 <LIA> 

A;Cross-references: UNIPROT : P25931; GB:M81490; NID:gl57996; PIDN : AAA28727 . 1 ; 

PID:gl57997 

C; Genetics: 

A; Gene : FlyBase : NepYr 

A; Cross-references : FlyBase : FBgn0004842 



C; Super family : neurokinin 1 receptor 

C; Keywords: appetite; G protein-coupled receptor; transmembrane protein 



Query Match 21.8%; Score 422.5; DB 2; Length 449; 

Best Local Similarity 30.4%; Pred. No. le-28; 

Matches 105; Conservative 59; Mismatches 12 6; Indels 55; Gaps 9; 

Qy 26 DYEDEFLRYLWRDYLYPKQYEWVLIAAWAVFVVALVGNTLVCLAWRNH 85 

II: I I : : :: I : : I : I I : I I II I : I I I I I I I I I 

Db 75 DYD-: LLSEDMWSSAYFKIIVYMLYIPIFIFALIGNGTVCYIVYSTPRMRTVTNYFI 129 

Qy 86 VN L S LADVLVT AI C L PAS L L- VD I T E S WL FGHAL C KYI P YLQAVS VS VAVLT LSFIPLDR 144 

: I : : I : I : : I I : I : : I . I I I I I I : I I I I I I I : II I : I I 
Db 130 ASLAIGDILMSFFCEPSSFISLFILNYWPFGLALCHFVNYSQAVSYLYSAYTLVAISIDR 189 

Qy 145 WYAICHPLLFKSTARRARGSILGIWAYSIAIMVPQAAViy[QSSSVXPELANRTRLFS-LCH 2 03 

: II II : I I I | |:| ::|| :| II :| |: :| 

Db 190 YIAIMWPLKPRITKRYATFIIAGVWFIALATALPIPIV — SGLDIPMSPWHTKCEKYICR 247 

Qy 204 ERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVl^NWKRPSD 263 

II I | : :: | | | :: I : I : : | :: | | : | 

Db 24 8 EMWPSRSQEYYYTLSLFALQFWPLGVLIFTYARITIRVWAKRPPGEA ETNRD 300 

Qy 264 QLGDLEQGLSGEPQPRGRAFIAEYKQMR7VRRKTAKMLMVVLLVFALCYLPISVLNYLKRY 323 

I : I : : I I I I : : I : : I I I : I I : : I : I 
Db 301 Q RMARS KRKMYKMMLT VVI VFT CCWL P FN I LQL L 334 

Qy 324 FGMFRQAS D REAV YAC FT FS HWLYYANS AAN P 1 1 YN FL S 362 

: I I . I I I I I I :: I I I I I ::: 

Db 335 LNDEEFAHWDPLPYVWFAF-HWLAMSHCCYNPIIYCYMN 372 



RESULT 2 
152315 

G protein-coupled receptor UHR-1 - rat 
C; Species: Rattus sp. (rat) 

C;Date: 10-Sep-1999 #sequence_revision 10-Sep-1999 #text_change 19-May-2000 
C; Accession: 152315 

R;Welch, S.K.; O'Hara, B.F.; Kilduff, T.S.; Heller, H.C. 
Biochem. Biophys . Res. Commun. 209, 606-613, 1995 

A; Title: Sequence and tissue distribution of a candidate G-coupled receptor 
cloned from rat hypothalamus . 

A;Reference number: 152315; MUID: 95251659 ; PMID:7733930 
A;Accession: 152315 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-370 <RES> 

A;Cross-references: GB:S77867; NID:g998527; PIDN:AAB34129 . 1; PID:g998528 
C; Superf amily : neurokinin 1 receptor 

Query Match 21.2%; Score 411.5; DB 1; Length 370; 

Best Local Similarity 30.9%; Pred. No. 7.5e-28; 

Matches 99; Conservative 62; Mismatches 118; Indels 41; Gaps 5; 

Qy 44 Q YEWYL I AAYVAYFWALYGNT LYC LAVWRNHHMRT VTN Y FI VN L S LADVLVTAI C L PAS 103 

I : : : : I I I I I I I I I : I : I : I I I : I I I : I : I I I : I I : I : 
Db 58 Q L KGL I YML Y S I VYWG L YGN C L L VL YI ARVRRLHNVTN F L I GN LAL S D YLMCAAC YP LT 117 



Qy 104 LLVDI-TESWLFGHALGKVIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRAR 162 

I I : I I I I : : : I I I : I I : I I I : I : I I : : I I I : : : : 

Db 118 LAYAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRYWLVHPLRRRISLKLSA 177 

Qy 163 GSILGIWAVSLAIMVPQAAVMQSSSVLPELANRTRLFSLCHERW-ADDLYPKIYHSCFFI 221 

: : I I I I I : I : : I I : I MM:: : I I : 

Db 178 YAVL G I WAL S AVLAL P AAVHT YH VE L K P H D VR LCEEFWGSQERQRQIYAWGLLL 231 

Qy 222. WYIAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRIWKRPSDQLGDLEQGLSGEPQPRGR 281 

I I I I I : : : I : : II I : I I : : : I I 
Db 232 GTYLLPLLAILLSYVRVSVKLRNRWPGSVTQSQADWDR 270 

Qy 282 AFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFT 341 

I I : I : I : I I :: I I I I I : I I : : I : I : : : I I I 
Db 271 ARRRRTFCLLWWWFALCWLPLHIFNLLR DLDPRAIDPYAFGLVQL 318 

Qy 342 FSHWLVYANSAANPIIYNFL 361 

I I I ::: I I I I : I 

Db 319 LCHWLAMS S AC YN P FI YAWL 338 



RESULT 3 
139187 

neuropeptide Y/peptide YY receptor Y2 - human 

N; Alternate names: neuropeptide y/peptide YY receptor type 2 

C; Species: Homo sapiens (man) 

C;Date: Ol-Mar-1996 #sequence_revision Ol-Mar-1996 #text_change 09-Jul-2004 
C;Accession: 139187; 139163; G02301 
R;Gerald, C.; Walker, M.W. ; Vaysse, P.J. 
J. Biol. Chem. 270, 26758-26761, 1995 

A;Title: Expression cloning and pharmacological characterization of a human 

hippocampal neuropeptide Y/peptide YY Y2 receptor subtype. 

A;Reference number: 139187; MUID : 96070760 ; PMID:7592910 

A;Accession: 139187 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-381 <GER> 

A;Cross-references : UNIPROT : P49146; EMBL : U36269 ; NID : gl063633 ; PIDN: AAC50281 . 1 ; 
PID:gl063634 

R;Rose, P.M.; Fernandes, P.; Lynch, J.S.; Frazier, S.T.; Fisher, S.M.; Kodukula, 

K. ; Kienzle, B. ; Seethala, R. 

J. Biol. Chem. 270, 22661-22664, 1995 

A; Title: Cloning and functional expression of a cDNA encoding a human type 2 
neuropeptide Y receptor. 

A; Reference number: 139163; MUID: 96032678 ; PMID : 7559383 
A;Accession: 139163 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A;Residues: 1-133, 1 A* , 135-381 <ROS> 

A;Cross-references: EMBL:U32500; NID : gl000750 ; PIDN : AAA93170 . 1 ; PID:gl000751 
R;Yan, H.; Yang, J.; Marasco, J.; Yamaguchi, K. ; Brenner, S.; Collins, F. ; 
Karbon, W. 

submitted to the EMBL Data Library, December 1995 
A; Reference number: H01019 
A; Accession: G02301 

A; Status: preliminary; translated from GB/EMBL/DDBJ 



A; Molecule type: mRNA 

A; Residues: 1-171, 'G',173, 'R 1 , 175-201, * P 1 , 203-208 , 1 A 1 , 210-381 XYAN> 

A; Cross-references: EMBL:U42389; NID: gl314329 ; PIDN : AAB07760 . 1 ; PID:gl314330 

C; Genetics : 

A; Gene: GDB : NPY2R 

A; Cross-references: GDB: 4365607; OMIM: 162642 

A;Map position: 4q31-4q31 

C; Super family : neurokinin 1 receptor 

C; Keywords: appetite; G protein-coupled receptor; glycoprotein; lipoprotein; 

thiolester bond; transmembrane protein 

F; 49-76/Domain : transmembrane #status predicted <TM1> 

F; 87-113/Domain: transmembrane #status predicted <TM2> 

F; 16 6-1 8 6/ Domain : transmembrane #status predicted <TM4> 

F; 22 1-2 37 /Domain : transmembrane #status predicted <TM5> 

F;269-291/Domain: transmembrane #status predicted <TM6> 

F; 305-32 8 /Domain : transmembrane #status predicted <TM7> 

F; 123-2 03/Disulfide bonds: #status predicted 

F; 342/Binding site: palmitate (Cys) (covalent) istatus predicted 
F; 372/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 20.2%; Score 392; DB 2; Length 381; 

Best Local Similarity 27.6%; Pred. No. 3.7e-26; 

Matches 101; Conservative 70; Mismatches 129; Indels 66; Gaps 9; 



Qy 


3 


P S AT PGAQMGVP PGS RE P S P VP P D YED E FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWALV 
III:: Mil: 1 : : 1 1 1 1 1 : : : : : : 


62 






\j \j 


Qy 


63 


GNT L VC LAW RNHHMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C KVI 

1 1 : 1 1 1 : 1 1 1 1 1 1 : 1 1 1 1 :: 1 1 : 1 1 : 1 1 1 : 1 . : 1 1 1 1 : : 
GNSLVIHWIKFKSMRTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLCHLV 


122 


Db 


67 


126 


Qy 


123 


P YLQAVS VS VAVLT L S FI P LD RWYAI CH PLL FKSTARRARGS I LGI - WAVS LAI MVPQAA 
1 1 1 : : 1 1 : : 1 1 : 1 1 1 1 1 : 1 1 : : 1 I : I : I : I : I I 
P YAQGLAVQVSTITLTVIALDRHRCIVYHLESK- 1 SKRI SFLI IGLAWGI SALLASPLAI 


181 


Db 


127 


185 


Qy 


182 


VMQSS— SVLPELANRTRLFSLCHERWADD LYPKIYHSCFFIVTYLAPLGLMAMAYF 

: 1 : : 1 : 1 hi : : 1 : 1 : : | : | | | : : : : | 
FREYSLIEIIPDFE IVACTEKWPGEEKSIYGTVYSLSSLLILYVLPLGIISFSYT 


236 


Db 


186 


240 


Qy 


237 


QIFRKLWGRQIPGTTSALVl^WKRPSDQLGDLEQGLSGEPQPRGRAFI^VKQMRARRKT 

: 1 : 1 1 II 1 1 : 1 1 
RIWSKLKNHVSPGA ANDHYHQRRQKT 


296 


Db 


241 


266 


Qy 


297 


AKMLMWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT F S HWLVYAN SAAN P I 
III: 1 : n : : 1 1 1 : : : I I : III: : : 1 1 1 : . 
TKMLVCWWFAVSWLP LHAFQLAVDIDSQVLDLKEYKLIFTVFHIIAMCSTFANPL 


356 


Db 


267 


323 


Qy 


357 


IYNFLS 362 




Db 


324 


: I ::: 
LYGWMN 329 




RESULT 4 

S50150 

gastric 


CCK-A 


receptor - rabbit 





C; Species: Oryctolagus cuniculus (domestic rabbit) 



C;Date: 14-Jul-1995 #sequence_revision 21-Jul-1995 #text__change 20-Apr-2000 
C;Accession: S50150 

R;Reuben, M. ; Rising, L.; Prinz, C; Hersey, S.; Sachs, G. 
Biochim. Biophys . Acta 1219, 321-327, 1994 

A; Title: Cloning and expression of the rabbit gastric CCK-A receptor. 

A; Reference number: S50150; MUID: 95002144 ; PMID:7918628 

A; Access ion: S50150 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-427 <REU> 

C; Super family : neurokinin 1 receptor 

Query Match 20.2%; Score 392; DB 2; Length 427; 

Best Local Similarity 28.8%; Pred. No. 4.2e-26; 

Matches 110; Conservative 78; Mismatches 150; Indels 44; Gaps 11; 
Qy 8 GAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VLIAAYVAVFWALVGN 64 



Db 9 GNASGIPP PCELGLDNETLFCLDQP PPSKEWQPAVQILLYSLIFLLSVLGN 59 

Qy 65 TLVCLAWRNHHMRTWNYFIWLSIJ^VLWAICLPASLLVDITESWLFGHALCKVIPY 124 

III : I I I I I I i I I : : : I : : : I : : : I : I : I :::::: | | | | | | | 

Db 60 TLVITVLIRNKRMRTVTNI FLLSLAI SDLMLCLFCMPFNLI PNLLKDFI FGSALCKTTTY 119 

Qy 125 LQ AVS VS VAVLT L S F I P L D RW YAI CHPLLFK — S TARRARG S I L G I WAVS LAI MVP QAAV 182 

I I I I I : I I I I : I : II I I I : I I I I : I I I I I 

Db 120 LMGT S VS VSTLNLVAI S LERYGAI CKPLQS RWQTKSHALKVI AATWCLS FAIMT P YP I Y 179 

Qy 183 MQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKL 242 



Db 180 SNLVPFTKTNNQTANMCRFLLPSDVMQQAWHTFLLLILFLIPGIVMMVAYGMISLEL 236 

Qy 243 W- GRQ I P GTT S ALVRNWK RPSDQLGDLEQGLSGEPQPRGRA 282 

: I : : : I : I : I I : I : I I I I I 

Db 237 YQGIKFDASQKKSAKERKASTGSGRFEDNDGCYLQRSKPTRQL-ELQQ-LSGGGGGRVSR 294 

Qy 283 F — LAEVKQMRARRKTAKMLMWLLVFALCYLP I SVLNVLKRVFGMFRQAS DREAVYACF 340 

: : | : : : : I I I I : : : : I I I : : I I I : : : : : I 
Db 295 I HS S S SAAALMAKKRVI RMLMVI WLFFLCWMP I FSANAWRAYDTV SAERRLSGTPI 351 

Qy 341 TFSHWLVYANSAANPIIYNFLS 362 

: I 11:1 I I I I I I : : 
Db 352 SFILLLSYTSSCVNPIIYCFMN 373 



RESULT 5 , ■ ' 

JC7809 

sulfakinin receptor protein, DSK-R1 - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 03-Jun-2002 #sequence_revision 03-Jun-2002 #text_change 09-Jul-2004 
C; Access ion: JC7809 

R;Kubiak, T.M.; Larsen, M.J. ; Burton, K.J.; Bannow, C.A.; Martin, R.A. ; 
Zantello, M.R.; Lowery, D.E. 

Biochem. Biophys. Res. Commun. 291, 313-320, 2002 

A; Title: Cloning and functional expression of the first Drosophila melanogaster 
sulfakinin receptor DSK-R1 . 

A; Reference number: JC7809; PMID: 11846406; MUID: 21835488 



A; Accession: JC7809 
A; Molecule type: mRNA 
A; Residues: 1-584 <KUB> 

A;Cross-references: UNIPROT : Q7M3 J6; GB:AX128640 

C; Comment: This receptor, the first functionally active orphan Drosophila 
sulfakinin G-protein-coupled receptor, with seven transmembrane domains, has the 
possible^roles in insect brain and/or gut functions. 
C; Genetics : 
A;Gene: dsk-rl 
A;Map position: 17 

F; 115-139/Domain: transmembrane region ((status predicted <TMR1> 
F; 149-167/Domain: transmembrane region #status predicted <TMR2> 
F; 189-207/Domain : transmembrane region #status predicted <TMR3> 
F; 229-250/Domain: transmembrane region #status predicted <TMR4> 
F;275-300/Domain: transmembrane region #status predicted <TMR5> 
F; 4 31-4 5 4 /Domain : transmembrane region #status predicted <TMR6> 
F; 4 67- 4 91/ Domain : transmembrane region #status predicted <TMR7> 

Query Match 20.0%; Score 388; DB 2; Length 584; 

Best Local Similarity 26.1%; Pred. No. 1.3e-25; 

Matches 118; Conservative 69; Mismatches 137; Indels 128; Gaps 15; 

Qy 3 P S AT P GAQMGVP P G S RE P S P VP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWALV 62 

I I : I I : I M I I I : I : ' I • - 

Db 96 PSSTPASSSSTSTG MPV W-LIPSYSMILLFAVL 127 

Qy 63 GNTLVCIAWRNHHMRTWNYFIWLSLAD 122 

I I I I : : I I I I : I I * I : : I I : : : I : I : : I : I : I : : : : : I I III: 
Db 128 GNLLVISTLVQNRRMRTITNVFLLNLAISDMLLGVLCMPVTLVGTLLRNFIFGEFLCKLF 187 

Qy 123 P YLQAVS VSVAVLTLS FI PLDRWYAI CHPLLFKS — TARRARGS I LGI WAVS LAIMVPQA 18 0 

: I I I I : I : II I : I : I I I I I I I : I I I I II : III 
Db , 188 QFSQAASVAVSSWTLVAISCERYYAICHPLRSRSWQTISHAYKIIGFIWLGGILCMTPIA 247 

Qy 181 AWQSSSVXPE]^RTRL-FSLCHERWADDLYPKIYHSCFFIWYIAPLGLMAMAYFQIF 239 

I : I : I : I I I I I I : : : | | : : : | | I 

Db 248 VFSQLIPTSRPGYCKCREFWPDQGYELFYNILLDFLLLVLPLLVLCVAYILIT 300 

Qy 240 RKLW GRQI PGT T S ALVRNWKRP S DQL 265 

II: II: III | : | | : : : 

Db 301 RT L YVGMAKD S GRI LQQ S L P VS ATTAGGS APN PGTSSSSNCI LVXTATAVYN- EN SNNNN 359 

Qy 266 GDLEQGLSG EPQPRGRAFLA EV 287 

I : I I I I I I 

Db 360 GNS EGS AGGGSTNMATTTLTTRPTAPTVI TTTTTTTVTLAKTS S PS I RVHDAALRRSNEA 419 

Qy 288 KQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFG-MFRQASDREAVYACFTFSHWL 346 

I : : : : : I I I I : .: I I : I : I : I : I : : I : : I I : I I 

Db 420 KTLESKKRWKMLFVLVLEFFI CWT PLYVINTMWILI GPVVYEYVD- -- — YTAI S FLQLL 475 

Qy 347 VYANSAANPIIYNFLS GLPW 366 

I : : I I I I I I : : I I I I 

Db 476 AYSSSCCNPITYCFMNASFRRAFVDTFKGLPW 507 



RESULT 6 
S17783 



tachykinin receptor homolog DTKR - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 22-Nov-1996 #sequence_revision 22-Nov-1996 #text_change 09-Jul-2004 
C;Accession: S17783 

R; Li, X.J.;, Wolfgang, W. ; Wu, Y.N.; North, R.A. ; Forte, M. 
EMBO J. 10, 3221-3229, 1991 

A; Title: Cloning, heterologous expression and developmental regulation of a 

Drosophila receptor for tachykinin-like peptides. 

A;Reference number: S17783; MUID : 92007772 ; PMID : 17 17263 

A;Accession: S17783 

A; Molecule type: mRNA 

A; Residues: 1-519 <LIX> 

A; Cross-references: ' UNIPROT : P30975 ; EMBL:X62711; NID:g8505; PIDN : CAA44595 . 1 ; 
PID:g8506 

A; Note: the sequence from Fig. 2 is inconsistent with that from Fig. 1 in 
lacking 481-Gly 
C; Genetics: 

A; Gene: FlyBase : Takr99D 

A; Cross-references : FlyBase : FBgn0004622 
C; Superfamily: neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; neurotransmitter receptor; transmembrane 
protein 

Query Match 19.9%; Score 385.5; DB 2; Length 519; 

Best Local Similarity 27.1%; Pred. No. 1.9e-25; 

Matches 105; Conservative 64; Mismatches 145; Indels . 73; Gaps 10; 



Qy 


3 


P SAT P GAQMGVP P G S RE P S PVPPDYED EFLRYLWRDYLY 

II 1 1 1 1 1 1 1 1 1:111 
PCRTLARSSPYPPVS FNHSQTLSTDQPAVGDVTSDAAEDAAASMETGSFAFWPWWRQV 


41 


Db 


45 


103 


Qy 

Db 


42 
104 


PKQYEWVLIAAWAVFVVALVGNTLVCL 

1 : : : : : 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 : 1 : : : : 
WSIL — FGGMVIVATGGNLIWWI WITTKRMRTWNYFIWLSIADAMVSSLNW 


101 
156 


Qy 


102 


ASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRA 

: : III II: : : : 1 : : 1 1 1 1 : 1 M 1 1 II : : 1 
FNYYYMLDSDWP FGEFYCKLSQFI AMLS I CASVFTLMAI S IDRYVAI I RPLQPRMSKRCN 


161 


Db 


157 


216 


Qy 


162 


RGSILGIWAVSLAIMVPQAAVMQSSSV-LPELANRTRLFSLCHERWAD DLYPKIY 

Mill : : : 1 : 1 : 1 1 1 :|: 1 1 : 1 
LAIAAVIWLASTLISCPMMIIYRTEEVPVRGLSNRT VCYPEWPDGPTNHSTMESLY 


215 


Db 


217 


272 


Qy 


216 


HSCFFIVTYLAPLGLMAMAYFQI FRKLWGRQI PGTTSALVRNWKRPSDQLGDLEQGLSGE 
: | : | | | : | : | : : : | | | : II 
NI LI 1 1 LT YFLP I VSMTVT YS RVGI E'LWGS K TIGE 


275 


Db 


273 


307 


Qy 


276 


PQPRGRAFLAEVT^QMRARRKTAKMLMWLLVF^ 

II : | : : | : : | : | | : : | | : | : | | : | : I | : : : : I : 
CTPR QVENVRSKRRWKMMIVWLIFAI CWLPFHSYFIITSCYPAITEAPFIQE 


335 


Db 


308 


361 


Qy 


336 


VYAC FT FS HWLVYAN S AAN P 1 1 YN FL S 362 
: 1 : 1 1 : 1 1 Mill::: 
LYLAI YWLAMSNSMYNPIIYCWMN 385 




Db 


362 





RESULT 7 



A34916 

neurokinin 3 receptor - rat 

N;Alternate names: neuromedin K receptor; NK-3 receptor 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 22-Jan-1993 #sequence_revision 22-Jan-1993 #text_change 09-Jul-2004 
C;Accession: A34916 

R;Shigemoto, R. ; Yokota, Y . ; Tsuchida, K.; Nakanishi, S. 
J. Biol. Chem. 265, 623-628, 1990 

A; Title: Cloning and expression of a rat neuromedin K receptor cDNA. 
A; Reference number: A34916; MUID : 90110113; PMID: 2153106 
A;Accession: A34916 
A;Molecule type: mRNA 
A; Residues: 1-452 <SHI> 

A; Cross-references: UNIPROT : P16177 ; GB:J05189; NID:g205670; PIDN : AAA4 1688 . 1 ; 
PID:g205671 

C; Superf amily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; membrane protein 

Query Match 19.9%; Score 384.5; DB 2; Length 452; 

Best Local Similarity 28.2%; Pred. No. 2e-25; 

Matches 99; Conservative 64; Mismatches 133; Indels 55; Gaps 9; 

Qy 11 MGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVBWALVGNTLVCLA 70 

: I : I : : I I I : : : I : : II I II II I I I : I I : I 

Db 45 LGLPATTQAPSQVRANLTNQFVQPSWRIAL WSL — AYGLWAVAVFGNLIVIWI 96 

Qy 71 WRNHHMRTWNYFI WLSLADVLVTAI CLPASLLVDITESWLFGHALCKVI PYLQAVSV 130 

: : I I I I II I I : I I I :: I II : : : III I : : : I 

Db 97 ILAHKRMRTVTNYFLVNLAFSDASVAAFNTLINFIYGLHSEWYFGANYCRFQNFFPITAV 156 

Qy 131 S VAVLT L S FI P L DRW YAI CHPLLFKS T ARRARGS I L GI WAVS LAI MVP QAAVMQ S S S VL P 190 

: : : : : I : I I : I I II : : I : I I I : : : II : I : I 

Db 157 FASIYSMTAIAVDRYMAIIDPLKPRLSATATKIVIGSIWILAFLLAFPQ-CLYSKIKVMP 215 

Qy 191 ELANRTRLFSLCHERWADDLYPK IYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQI 247 

II I I : : I : II II I : I I I : I : I : I I I : I 

Db 216 GRT LCYVQWPEG — PKQHFTYHIIVIILVYCFPLLIMGVTYTIVGITLWGGEI 266 

Qy 24 8 P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQP RGRAFLAEVKQMRARRKTAKMLMWLLVF 307 

III : | :: | : | | I I : : : I : : I 

Db 267 PGDTCDKYH ; — EQLKAKRKWKMMI I WVTF 295 

Qy 308 ALCYLPI SVLNVLKRVFGMFRQAS DREAWACFT FSHWLWANSAANP 1 1 Y 358 

I : I : I I I : I : : : : I I I I I : : : I I I I I 

Db 296 AICWLPYHVYFILTAIYQQLNRWKYIQQVYLA SFWLAMSSTMYNPI IY 343 



RESULT 8 
JQ1517 

neurokinin 3 receptor - human 

N;Alternate names: neuromedin K receptor; NK-3 receptor 
C; Species: Homo sapiens (man) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 09-Jul-2004 
C;Accession: JQ1517; S20435; S21237 

R;Huang, R.R.C.; Cheung, A.H.; Mazina, K.E.; Strader, CD. ; Fong, T.M. 
Biochem. Biophys . Res. Commun. 184, 966-972, 1992 



A; Title: cDNA sequence and heterologous expression of the human neurokinin-3 
receptor . 

A; Reference number: JQ1517; MUID : 92246993 ; PMID: 1374246 
A;Accession: JQ1517 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-465 <HUA> 

A; Cross-references: UNIPROT: P29371; GB:M89473; NID:gl89223; PIDN : AAA36366 . 1 ; 
PID:gl89224 

A; Experimental source: brain 

R;Buell, G.; Schulz, M.F.; Arkinstall, S.J.; Maury, K. ; Missotten, M. ; Adami, 
N. ; Talabot, F. ; Kawashima, E. 
FEBS Lett. 299, 90-95, 1992 

A; Title: Molecular characterisation, expression and localisation of human 
neurokinin-3 receptor. 

A; Reference number: S20435; MUID : 92183914 ; PMID: 1312036 
A;Accession: S20435 
A; Status: preliminary 
A;Molecule type: DNA 

A;Residues: 1-2, f I T , 4 — 62, 1 R ' , 64-465 <BUE> 

A;Cross-references: GB:S86392; NID:g246908; PIDN : AAB21706 . 1 ; PID:g246909 
R;Takahashi, K. ; Tanaka, A.; Hara, M. ; Nakanishi, S. 
Eur. J. Biocheirw 204, 1025-1033, 1992 

A; Title: The primary structure and gene organization of human substance P and 
neuromedin K receptors . 

A; Reference number: S21188; MUID : 92201186; PMID: 1312928 
A; Accession: S21237 
A; Status: preliminary 
A;Molecule type: DNA 

A; Residues: 1-438, 1 F 1 , 440-465 <TAK> 

A;Cross-references: GB:X65172; NID:g35022; PIDN : CAA46291 . 1 ; PID:g825695 

C; Comment: The endogenous ligand of this receptor is neurokinin 3 (neuromedin 

K) , one of the peptides in the mammalian tachykinin system. 

C; Genetics: 

A; Gene : GDB : TACR3 

A; Cross-references : GDB: 9599126 

C; Superfamily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
transmembrane protein 

F; 83-111/Domain : transmembrane jfstatus predicted <TM1> 
F; 122- 147 /Domain: transmembrane #status predicted <TM2> 
F; 160-181/Domain: transmembrane ftstatus predicted <TM3> 
F; 202-221/Domain : transmembrane #status predicted <TM4> 
F;247-272/Domain: transmembrane #status predicted <TM5> 
F; 3 00- 321 /Domain : transmembrane #status predicted <TM6> 
F; 333-355/Domain: transmembrane #status predicted <TM7> 

F; 23, 50, 73/Binding site: carbohydrate (Asn) (covalent) ftstatus predicted 
F; 158-233/Disulfide bonds: #status predicted 

Query Match 19.8%; Score 384; DB 1; Length 465; 

Best Local Similarity 27.9%; Pred. No. 2.3e-25; 

Matches 100; Conservative 67; Mismatches 135; Indels 56; Gaps 10; 

Qy 4 SATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVALVG 63 

I : : I I : I : I I II : : : I : : II I II II I ' I I : : I 

Db 52 SSSPSA-LGLPVASPAPSQPWANLTNQFVQPSWRIAL WSL— AYGVWAVAVLG 102 



Qy 64 NTLVCLAWRNHHMRTWNYFIWLSLADVTjW 123 

I : I : : I I I I I I I I : I I I : : I : I : : : III I : 

Db 103 NLIVIWIILAHKRMRTVTNYFL^ 162 

Qy 124 YLQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVPQ AAVM 183 

: : I : : : : : I : M : I I II : : I : I I I : : : II : 
Db 163 FFP I TAVFAS I YSMTAI AVDRYMAI I DPLKPRLS ATAT KI VI GS I WI LAFLLAFPQ- CLY 221 

Qy 184 QSSSVLPELANRTRLFSLCHERWADDLYPK IYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

: I : I II II : I : II II I : I I I : I : I : 

Db 222 SKTKVMP GRT LCFVQWPEG — PKQHFTYHIIVIILVYCFPLLIMGITYTIVGI 272 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

111:1111 : I :: I : I I I I : 

Db 273 TLWGGEI PGDTCDKYH EQLKAKRKWKMM 301 

Qy 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAAN P 1 1 Y 358 

: : I : : I I : I : I I : : I : : : : I I III : : : I I I I I 

Db 302 1 1 WMT FAI CWLP YHI YFI LTAI YQQLNRWKYI QQVYLA SFWLAMSSTMYNPIIY 356 



RESULT 9 
S55524 

neurokinin 3 receptor - mouse (fragment) 

N; Alternate names: neuromedin K receptor; NK-3 receptor 
C; Species: Mus musculus (house mouse) 

C;Date: 01-Aug-1995 #sequence_revision Ol-Sep-1995 #text__change 09-Jul-2004 
C;Accession: S55524; 173045 
R;Maroteaux, L. 

submitted to the EMBL Data Library, June 1995 
A; Reference number: S55524 
A; Accession: S 5552 4 
A;Molecule type: mRNA 
A; Residues: 1-385 <MAR> 

A; Cross-references: UNIPROT : P47937 ; EMBL : X87 823 ; NID:g861055; PIDN : CAA61088 . 1; 
PID:g861056 

R; Cook, . G.A. ; Elliott, D. ; Metwali, A.; Blum, A.M.; Sandor, M. ; Lynch, R. ; 
Weinstock, J.V. 

J. Immunol. 152, 1830-1835, 1994 

A; Title: Molecular evidence that granuloma T lymphocytes in murine 
schistosomiasis mansoni express an authentic substance P (NK-1) receptor. 
A; Reference number: 156216; MUID : 94165478 ; PMID: 8120392 
A;Accession: 173045 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 

A;Residues: 103-197, ' S 1 , 199-266, 1 P 1 , 268-328 <C\30> 

A; Cross-references: GB:L27827; NID:g450288; PIDN :AAA17 893 . 1 ; PID:g480780 
C; Superf amily: neurokinin 1 receptor 

Query Match 19.5%; Score 378.5; DB 2; Length 385; 

Best Local Similarity 28.2%; Pred. No. 5.5e-25; 

Matches 99; Conservative 63; Mismatches 134; Indels 55; Gaps 9; 

Qy 11 MGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLA 70 

:|:| hill : ::|:: II I II I I I ||: II :| 

Db 45 LGLPVTSQAPSQVRDNLTNQFVQPSWRIAL WSL — AYG LWAVAVFGN L I VI W I 96 



Qy 71 WRNHHMRTVTNYFIWLSL7VDVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSV 130 

: : I I I I I I I I : I I I : : I I I : : : III | : : ,: | 

Db 97 I LAH KRMRT VTN YFLVN LAFS DAS VAAFNT L W F I YGVH S EW Y FGAN YC RFQN F F P I T AV 156 

Qy 131 S VAVL TLSFIPLD RW Y AI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q AAVMQ S S S VL P 190 

:: ::: I :||: II II : :| : I II :: : II : |:| 
Db 157 FASIYSMTAIAVDRYmilDPLKPRLSATATKIVIGSIWILAFLLAFPQ-CLYSKIKVMP 215 

Qy 191 ELANRTRLFSLCHERWADDLYPK 1 YHSCFFIVTYLAPLGLMAMAYFQI FRKLWGRQI 247 

II I I : : I : II II I : I I I : I : I : Ml : I 

Db 216 GRT LCYVQWPEG— PKQHFTYHIIVIILVYCFPLLIMGVTYTIVGITLWGGEI 266 

Qy 24 8 P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKT AKMLMWLLVF 307 

II : | : : | : | | I I : : : I : : I 

Db 267 LGDTCDKYH 7 EQLKAKRKWKMMIIVWTF 295 

Qy 308 ALC YL P I S VLNVLKRVFGMFRQ AS D REAVYAC FT F S HW LVYAN S AAN P 1 1 Y 358 

I : I : II I : I : : : : I I III : : : II I I I 

Db 296 AI CWLPYHVYFI LTAI YQQLNRWKYIQQVYLA SFWLAMSSTMYNPIIY 343 



RESULT 10 
JN0692 

cholecystokinin type A receptor - human 
C; Species: Homo sapiens (man) 

C;Date: 03-Feb-1994 #sequence_revision 03-Feb-1994 #text_change 09-Jul-2004 
C;Accession: JN0692; JN0590 

R;de Weerth, A.; Pisegna, J.R.; Huppi, K. ; Wank, S.A. 
Biochem. Biophys . Res. Commun. 194, 811-818, 1993 

A;Title: Molecular cloning, functional expression and chromosomal localization 

of the human cholecystokinin type A receptor. 

A; Reference number: JN0692; MUID: 93343941; PMID: 8343165 

A;Accession: JN0692 

A; Molecule type: mRNA 

A; Residues: 1-42 8 <DEW> 

A; Cross-references: UNIPROT : P32238 ; GB:L19315; NID:g306595; PIDN : AAA02819 . 1 ; 
PID:g306596 

A; Experimental source: gallbladder 

R;Ulrich, CD.; Ferber, I.; Holicky, E.; Hadac, E.; Buell, G. ; Miller, L.J. 
Biochem. Biophys. Res. Commun. 193, 204-211, 1993 

A; Title: Molecular cloning and functional expression of the human gallbladder 
cholecystokinin A receptor. 

A; Reference number: JN0590; MUID : 93277552 ; PMID: 8503909 
A; Accession: JN0590 

A; Status: nucleic acid sequence not shown 
A; Molecule type: DNA ; ' "" 

A; Residues: 1-428 <ULR> 

A; Cross-references: GB:L13605; NID:g306490; PIDN : AAA35659 . 1 ; PID:g306491 
A; Experimental source: gallbladder 

C; Comment: This protein has diverse physiological roles in the gastrointestinal 

system where it mediates pancreatic growth and enzyme secretion, smooth muscle 

contraction of the gallbladder and stomach, and secretion from gastric mucosal 

cells. 

C; Genetics: 

A; Gene: GDB : CCKAR 

A;Cross-references : GDB: 141927; OMIM: 118444 
A;Map position: 4pter-4qter 



C;Superfamily: neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
phosphoprotein; transmembrane protein 

F; 4 0-67 /Domain : transmembrane #status predicted <TM1> 
F; 7 8- 104 /Domain: transmembrane #status predicted <TM2> 
F; 116- 137 /Domain : transmembrane #status predicted <TM3> 
F; 158-17 8 /Domain: transmembrane #status predicted <TM4> 
F;208-234/Domain: transmembrane #status predicted <TM5> 
F; 314-332/Domain: transmembrane #status predicted <TM6> 
F; 350-369/Domain: transmembrane #status predicted <TM7> 

F; 10, 24 , 190, 299/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;245, 249,260, 290/Binding site: phosphate (Ser) (covalent) (by protein kinase C) 
#status predicted 

F;256/Binding site: phosphate (Ser) (covalent) (by protein kinase A) #status 
predicted 

Query Match 19.3%; Score 374.5; DB 2; Length 428; 

Best Local Similarity 28.3%; Pred. No. 1.4e-24; 

Matches 106; Conservative 76; Mismatches 155; Indels 37; Gaps 10; 



Qy 


16 


GSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VLIAAYVAVFWALVGNTLVCLAVW 

II 1 1 : 1 1 1 1 1 : II II 1 : | : : : : : M 1 1 1 : 
GSNITPPCELGLENETLFCL — DQPRPSK-EWQPAVQILLYSLIFLLSVLGNTLVITVLI 


72 


Db 


11 


67 


Qy 


73 


RNHHMRTWNYFIWLSl^VLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSV 
II | | | | | | | : : : | : : : | : : : I : I : I :::::: | | | : | | | I I I I 
RNKRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPNLLKDFIFGSAVCKTTTYFMGT SVSV 


132 


Db 


68 


127 


Qy 


133 


AVLTLSFIPLDRWYAICHPLLFK — S TARRARG S I LG I WAVS LAI MVP QAAVMQ S S S VL P 

: 1 1 1 : 1 : 1 1 1 1 1 : 1 1 1 1 : 1 III 1 : : : 1 

STFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLVP 


190 


Db 


128 


184 


Qy 


191 


ELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLW 


243 


Db 


185 


1 : ::| :|: : :|: :: :| 1 :| :|| 1 :|: 
FTKNNNQTANMCRFLLPNDVMQQSWHTFLLLILFLIPGIVMMVAYGLISLELYQGIKFEA 


244 


Qy 


244 


GRQIPGTTSA LVRNWKRPSDQLGDLEQGLSGEPQPRGR-AFLAEVK 

: 1 1 1 1 : 1 1 : 1 : 1 1 : 1 1 : 
SQKKSAKERKPSTTSSGKYEDSDGCYLQKTRPPRKL-ELRQLSTGSSSRANRIRSNSSAA 


288 


Db 


245 


303 


Qy 


289 


QMRARRKT AKMLMWLLVFALC YL P I S VLNVL KRVFGMFRQAS DREAVYAC FT FS HWLVY 
: 1 : : : : 1 1 : 1 : : : : 1 1 1 : : 1 1 1 : : : : 1 : 1 II 
NLMAKKRVI RMLI VI WLFFLCWMP I FSANAWR AYDTAS AERRLS GT P I S FI LLLSY 


348 


Db 


304 


360 


Qy 


349 


ANSAANPIIYNFLS 362 

: 1. Tl INI:: < - 
TSSCVNPIIYCFMN 374 




Db 


361 





RESULT 11 
JC2459 

gastrin/cholecystokinin B receptor - rabbit 

C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 21-Feb-1995 #sequence_revision 05-Apr~199.5 #text_change 09-Jul-2004 
C;Accession: JC2459 

R;Blandizzi, C. ; Song, I.; Yamada, T. 

Biochem. Biophys. Res. Commun. 202, 947-953, 1994 



A;Title: Molecular cloning and structural analysis of the rabbit gastrin/CCKB 
receptor gene. 

A; Reference number: JC2459; MUID: 94324990 ; PMID: 8048969 
A;Accession: JC2459 
A; Molecule type: mRNA 
A; Residues: 1-452 <BLA> 

A;Cross-references: UNIPROT : P46627 ; GB:L31548; NID:g495663; PIDN : AAA31194 . 1 ; 

PID:g495665 

C; Genetics: 

A;Introns: 49/1; 133/1; 216/2; 273/1 

C; Super family : neurokinin 1 receptor 

C; Keywords: receptor; transmembrane protein 

F; 5 6-7 9 /Domain : transmembrane ((status predicted <TM1> 

F; 85-104/Domain: transmembrane #status predicted <TM2> 

F; 130-149/Domain: transmembrane ftstatus predicted <TM3> 

F; 169-187/Domain: transmembrane #status predicted <TM4> 

F; 2 17-2 37 /Domain : transmembrane #status predicted <TM5> 

F; 33 9- 35 9 /Domain : transmembrane #status predicted <TM6> 

F; 3 8 1-4 00/ Domain : transmembrane #status predicted <TM7> 

Query Match 19.3%; Score 373.5; DB 2; Length 452; 

Best Local Similarity 27.0%; Pred. No. 1.8e-24; 

Matches 99; Conservative 69; Mismatches 124; Indels 75; Gaps 9; 

Qy 48 VLIAAWAVFWALVGNTLVCLAWRN^ 107 

: : I : | : : : : | | | : : : : : I I I I I I : : : I : : : I : I : I : I : I I : 
Db 54 I RVTL YAVI FLMS VGGN I L 1 1 WLGLS RRLRTVTNAFLLS LAVS DLLLAVACMP FTLLPN 113 

Qy 108 I T E S WL FGHALCKVT P YLQAVS VS VAVLT L S FI P LDRW YAI CH P LL FK — STARRARGSI 165 

: : : : I I : I I : I I I I I I I : I : I I I : I : I I I I I : I I I 

Db 114 LMGTFIFGTVICKAVSYLMGVSVSVSTLSLVAIALERYSAICRPLQARVWQTRSHAARVI 173 

Qy 166 LGIWAVSLAIMVPQAAVMQSSSVXPELANRTRLF 225 

I I : I : I I I II: III : : : : : 

Db 174 LAT WL L S G L LMVP Y P VYT AVQ P VG P RVLQ CVHRWPSARVRQTWSVLLLLLLFF 226 

Qy 226 APLGLMAMAYFQI FRKLW GRQI PGTTSALVRNWKRP 261 

I : I I : I I I I : I : I I I I 
Db 227 VPGWMAVAYGLISRELYLGLRFDSDSDSESQSRVRGQGGLPGGAAPG P 275 

Qy 262 SDQLGDL— EQGLSGE PQPRGRAFL AEVKQMRARR 294 

II 111:11 Mil I : I : I : : 

Db 276 VHQNGRCRPEAGLAGEDGDGCYVQLPRSRPALELSALTAPISGPGPGPRPAQAK-LLAKK 334 

Qy 295 KT AKMLMWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN 354 

: : | | : | : : : : | : I : | | : I : I : I I Mill:: I 

Db 335 RWRMLLVIWLFFMCWLPVYSANTWRAFDG PGAHRALSGAPISFIHLLSYASACVN 391 

Qy 355 PIIYNFL 361 

I : : I I : 

Db 392 PLVYCFM 398 



RESULT 12 
JQ1614 

gastrin receptor - multimammate rat (Mastomys natalensis) 
C; Species: Mastomys natalensis 



C;Date: 17-Apr-1993 #sequence_revision 17-Apr-1993 #text_change 09-Jul-2004 
C; Accession: JQ1614 

R;Nakata, H.; Matsui, T . ; Ito, M. ; Taniguchi, T.; Naribayashi, Y . ; Arima, N.; 
Nakamura, A.; Kinoshita, Y. ; Chihara, K. ; Hosoda, S.; Chiba, T. 
Biochem. Biophys. Res. Commun. 187, 1151-1157, 1992 

A; Title: Cloning and characterization of gastrin receptor from ECL carcinoid 
tumor of Mastomys natalensis . 

A; Reference number: JQ1614; MUID: 92412082 ; PMID: 1530611 
A; Accession: JQ1614 
A; Molecule type: mRNA 
A; Residues: 1-450 <NAK> 

A; Cross-references: UNIPROT : P30796; GB:D12817; NID:g220646; PIDN : BAA02250 . 1 ; 
PID:g220647 

C; Superf amily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
transmembrane protein 

F; 5 9- 8 3/ Domain : transmembrane #status predicted <TM1> 
F; 87-109/Domain : transmembrane ((status predicted <TM2> 
F; 132-150/Domain: transmembrane #status predicted <TM3> 
F; 172-188/Domain: transmembrane #status predicted <TM4> 
F;216-243/Domain: transmembrane #status predicted <TM5> 
F; 334-357/Domain: transmembrane ((status predicted <TM6> 
F; 380-398/Domain: transmembrane #status predicted <TM7> 

F; 7, 30, 36/Binding site: carbohydrate (Asn) (covalent) ((status predicted 

Query Match 19.2%; Score 371.5; DB 2; Length 450; 

Best Local Similarity 28.8%; Pred. No. 2.6e-24; 

Matches 101; Conservative 67; Mismatches 136; Indels 47; Gaps 9; 

Qy 4 8 VL I AAYVAVFWAL VGNT L VC LAW RNH HMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD 107 

: I I : I : : : : I I I : : : : : I I I I I I : : : I : : : I : I : I : | : I | : 
Db 56 IRITLYAVIFLMSIGGNMLIIWLGLSRRLRTVTNAFLLSLAVSDLLLAVACMPFTLLPN 115 

Qy 108 I TESWLFGHALCKVI P YLQAVSVSVAVLTLS FI PLDRWYAI CHPLLFK- - STARRARGS I 165 

: : : : I I : I I : I I I I I I I : I I I I : I : I I I I I : I I I 
Db 116 LMGTFI FGTVI CKAVSYLMGVSVSVSTLNLVAIALERYSAICRPLQARVWQTRSHAARVI 175 

Qy 166 LGIWAVSLAIMVPQAAWIQSSSVXPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYL 225 

I I : I : I I I* II: III : : : : : 

Db 176 LATWLLSGLLMVPYPVYTWQPVGPRVLQ CMH RW P S ARVRQT W S VL L LML L F F 228 

Qy 226 APLGLMAMAYFQIFRKLW-GRQIPG TTSALVRNW KRPSDQLGDLEQ-G 271 

I : I I : I I I I : I : I : I I : I I I I I I 

Db 22 9 IPGVVMAVAYGLISRELYLGLRFDGDNDSDTQSRVRNQGGLPGGTAPGPVHQNGGCRHVT 288 

Qy 272 LSGEPQ PRGRAFL — AEVKQMRARRKTAKMLMWLLVFALC 310 

: : I I III : I : : I : : : : I I : I : : I : I I I 

Db 289 VAGEDNDGCYVQLPRSRLEMTTLTTPTPGPGLASANQAKLLAKKRWRMLLVIVLLFFLC 348 

Qy 311 YLP I S VTiNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANS AANP 1 1 YN FL 361 

: I I I I I : I I : | I | | | : : I I : : I I : 

Db 349 WLPIYSANTWCAFDG PGAHRALSGAPISFIHLLSYASACVNPLVYCFM 396 



RESULT 13 
159336 

galanin receptor 1 - human 



C; Species: Homo sapiens (man) 

C;Date: 31-May-1996 #sequence_revision 31-May-1996 #text_change 09-Jul-2004 
C;Accession: 159336; JC5801; G01765; G02528 

R;Habert-Ortoli, E . ; Amiranoff, B.; Loquet, I.; Laburthe, M. ; Mayaux, J. 
Proc. Natl. Acaci. Sci. U.S.A. 91, 9780-9783, 1994 

A;Title: Molecular cloning of a functional human galanin receptor. 
A;Reference number: 159336; MUID: 95024044 ; PMID:7524088 
A;Accession: 159336 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-349 <HAB> 

A;Cross-references : UNIPROT : P47211; GB:L34339; NID:g559047; PIDN : AAA50767 . 1 ; 
PID:g559048 

R;Lorimer, D.D.; Matkowskj, K. ; Benya, R.V. 
Biochem. Biophys . Res. Commun. 241, 558-564, 1997 

A; Title: Cloning, chromosomal location, and transcriptional regulation of the 
human galanin-1 receptor gene (GALN1R) . 

A; Reference number: JC5801; MUID : 98 08*6390 ; PMID: 9425310 
A;Accession: JC5801 

A; Status: nucleic acid sequence not shown ; 
A;Molecule type: mRNA 
A; Residues: 1-349 <LOR> 

A;Cross-references: GB:U53511; NID: gl297337 ; PIDN : AAC51936 . 1; PID:gl297338 
A;Note: submitted to the EMBL Data Library, April 1996 
R;Ross, P.C. 

submitted to the EMBL Data Library, March 1995 
A; Reference number: G08350 
A;Accession: G01765 

A; Status: translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 1-14 , ' W 1 , 16-34 9 <ROS> 

A;Cross-references: EMBL:U23854; NID:g775209; PID:g775210 

C; Comment: This receptor inhibits cAMP formation, stimulates and inhibits 

phospholipose C activity, decreases phorbol ester-induced protein 

phosphorylation by a protein kinase C-independent mechanism, and increases 

arachadonic acid metabolism, as well as opens ATP-dependent K+ but closes N-type 

Ca2 + channels. 

C; Genetics: 

A; Gene: GDB : GALNR 

A; Cross-references : GDB: 392699; OMIM: 600377 

A; Map position: 18q23-18q23 

C; Superfamily : vertebrate rhodopsin 

Query Match 19.2%; Score 371; DB 2; Length 349; 

Best Local Similarity 29.9%; Pred. No. 2.2e-24; 

Matches 97; Conservative - '62; Mismatches 107; Indels 58; Gaps 10; 

Qy 47 WVLIAAYVAVFWALVGNTLVCLAVWRNH — HMRTVTNYFIVNLSLADVLVTAICLPASL 104 

: I : : : I : : : I I : I I : I : I : I I I I : I I I : I I : I : I 

Db 34 FVTLWFGLI FALGVLGNSLVITVLARSKPGKPRSTTNLFILNLS IADLAYLLFCI PFQA 93 

Qy 105 LVT)ITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGS 164 

I. : I : I : I i I I I I : I : : I I : : : I I : I I I I : I : I : 

Db 94 T VYAL P T WVL GAF ICKFIHYF FT VSML VS I FT LAAMS VD RYVAI VH S RR- S S S LRVS RN A 152 



QY 



165 ILG IWAVSLAIMVPQAAVlyiQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFI 221 

: I I I I I : I : I : II I : I : I I hi I : I I I I : 



Db 



153 LLGVGCI WALS IAMAS PVA- - YHQGLFHPRASNQT FCW EQW P D P RH KKAYWCT FV 2 06 



Qy 222 VTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGR 281 

II II I: I :: I : ::| : |: 
Db 207 FGYLLPLLLICFCYAKVLNHLHKK LKNMSKKSE 239 

Qy 282 AFLAEVKQMRARRKTAKMLMWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYACFT 341 

: : : I I I : : : I I : : I I : : I I : : : I I I : I II 
Db 240 AS K KKT AQT VLWWVFG ISWLPHHIIH- LWAE FGVF — PLTPASFL 283 

Qy 342 F SHWLVYANSAANPIIYNFLS 362 

I : I I I : I I : I I I I I I I I 
Db 284 FRITAHCLAYSNSSVNPIIYAFLS 307 



RESULT 14 
151898 

cholecystokinin A receptor - guinea pig 
C; Species: Cavia porcellus (guinea pig) 

C;Date: 04-Sep-1997 #sequence_revision 04-S.ep-1997 #text_change 09-Jul-2004 
C; Accession: 151898 

R; De Weerth, A.; Pisegna, J.R.; Wank, S.A. 
Am. J. Physiol. 265 f G1116-G1121, 1993 

A; Title: Guinea pig gallbladder and pancreas possess identical CCK-A receptor 

subtypes: receptor cloning and expression. 

A;Reference number: 151898; MUID: 94106629; PMID:7916580 

A;Accession: 151898. 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-430 <RES> 

A; Cross-references: UNIPROT : Q63931 ; GB:S68242; NID:g544723; PIDN : AAB29504 . 1 ; 
PID:g544724 

C; Superfamily: neurokinin 1 receptor 

Query Match 19.1%; Score 370.5; DB 2; Length 430; 

Best Local Similarity 27.4%; Pred. No. 3e-24; 

Matches ,104; Conservative 77; Mismatches 152; Indels 47; Gaps 10; 

Qy 16 GS RE P S PVP P D YE DE FLRYLWRD YL Y P K- Q YEW VL I AAYVAVFWALVGNT LVC LAV 71 

II I : I : I I II I : II II I : I :: : : : I I I I I : 
Db 11 GSNITSACELGFENETLFCLDR PRPSKEWQPAVQILLYSLIFLLSVLGNTLVITVL 66 

Qy 72 WRNHHMRT VTN Y F I VN LS LADVLVT AI CL P AS LLVD I T E S WL FGHALC KVI P YLQAVS VS 131 

II I I I I I I I :: : I : : : I :: : I : I : I : : : :: I I I : I I I III 
Db 67 IRNKRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPSLLKDFIFGSAVCKTTTYFMGTSVS 126 

Qy 132 VAVLTLS FI PLDRWYAICHPLLFK — STARRARGS I LGIWAVSLAIMVPQAAVMQS S SVL 189 

I : I I I : I : I I I I I : I I I I : I III I : : : 

Db 127 VSTFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLV 183 

Qy 190 PELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLW-GRQIP 248 



Db 184 PFTKNNNQTGNMCRFLLPNDVMQQTWHTFLLLILFLIPGIVMMVAYGLISLELYQGIKFD 243 

Qy 249 GTTSALVRNWKRPSDQLGDLEQG LSGEPQPRGRAFLAEVKQ 289 

: I : I : I I II I :: I 

Db 244 AIQKKSAKERKTSTGSSGPMEDSDGCYLQKSRHPR KLELRQLSPSSSGSNRINRIR 299 



Qy 2 90 MRARRKT AKMLMWLLVFALC YL P I S VLNVL KRVFGMFRQAS DREAVYAC FT F 342 

: I : : : : I I : I : : : : I I I : : I I I : : : : : | : I 

Db 300 S S S S TAN LMAKKRVT RML I VI WL FFLCWMP I FS ANAWRAYDT V SAERHLSGTPISF 356 

Qy 343 SHWLVYANSAANPIIYNFLS 362 

I I : I I I I I I I : : 
Db - 357 ILLLSYTSSCVN-PIIYCFMN 376 



RESULT 15 
A44081 

kappa-type opioid receptor - human 
C; Species: Homo sapiens (man) 

C;Date: 27-Jun-1994 #sequence_revision 27-Jun-1994 #text_change 09-Jul-2004 
C; Accession: A44 081 

R;Xie, G.X.; Miyajima, A.; Goldstein, A. 

Proc. Natl. Acad. Sci. U.S.A. 89, 4124-4128, 1992 

A; Title: Expression cloning of cDNA encoding a seven-helix receptor from human 

placenta with affinity for opioid ligands . 

A;Reference number: A44081; MUID: 92237319 ; PMID: 1315051 

A; Access ion: A44 081 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-440 <XIE> 

A; Cross-references: UNIPROT : P30098 ; GB:M84605; NID:gl89391; PIDN: AAA36395 . 1 ; 
PID:gl89392 

C; Super family : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; transmembrane protein 

Query Match . 19.1%; Score 370.5; DB 2; Length 440; 

Best Local Similarity 28.3%; Pred. No. 3.1e-24; 

Matches 106; Conservative 59; Mismatches 138; Indels 72; Gaps 12; 

Qy 4 SAT P GAQMGVP P GSREPSPVP PDYEDEFLRYLWRDYLYPKQ 4 4 

I I I I III : I I I I llhl I 
Db 9 SAWPG— WGWPPPAALRNLTSSPAPTASPSPAPSWTPSPRPGPAHPFLQPPWAVAL 62 

Qy 45 YEWVL I AAWAVFVVALVGNT LVC IAVW RNH HMRT VTN YF I VNL S LAD VLVTAI C L PAS L 104 

II I I I I I I :: I I : I I : I I I I I I I : I I I : I I : I : : 
Db 63 — WS L — AYGAWAVAVXGNL VVIWI VLAHKRMRTVTNS FLW 118 

Qy 105 LVT)ITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKST7VRRARGS 164 

: : III I : : : I : : : : : I : I I : I I II : : I I 
Db 119 IYALHGEWYFGANYCRFQNFFPITAVFASIYSMTAIAVDRYMAIIDPLKPRLSATATRIV 178 

Qy 165 ILGIWAVSLAIMVPQAAVMQSSSVLPELANRTRLFSLCHERWADDLYPKI-YHSCFFIW 223 

Ml::: II. : hi II I I : " : I : II : : 

Db 179 IGSIWILAFLLAFPQ-CLYSKIKVMP — -GRT LCYVQWPEGSRQHFTYHMIVIVLV 2 30 

Qy 224 YIAPLGLMAMAYFQIFRKLWGRQIPGTTSALVT^NWKRPSDQLGDLEQGLSGEPQPRGRAF 283 

I I I : I : I : I I I : I I I I II 
Db 231 YCFPLLIMGITYTIVGITLWGGEI PGDTC DKYQ 263 

Qy 284 LAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFS 343 

: I : : I : I I I I : : : I : : I I : I : I I : : I : : : : I I I 
Db 264 EQLKAKRKWKMMI I VWT FAI CWLP YHI YFI LTAI YQQLNRWKYI QQVYLA S 316 



Qy 344 HWLVYANSAANPIIY 358 

II : : : I I I I I 
Db 317 FWLAMSSTMYNPIIY 331 



Search completed: October 14, 2004, 10:56:56 
Job time : 21.6209 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 14, 2004, 10:55:56 ; Search time 68.6732 Seconds 

(without alignments) 
1737.280 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-070-532-4 
1937 

1 ME P SAT P GAQMG VP P G S RE P 



NSAANPIIYNFLSGLPWSLL 369 



Scoring table: 



BLOSUM62 • 1 

Gapop 10.0 , Gapext 0.5 



Searched: 



1360919 seqs, 323318874 residues 



Total number of hits satisfying chosen parameters: 



1360919 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Published_Applications_AA: * 



1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep:* 

3: /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep:* 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2__6/ptodata/2/pubpaa/US09A_PUBCOMB.pep:* 
10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep:* 
11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep:* 
13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep:* 
15: / cgn2_6/p toda ta/ 2 /pubpaa/US 1 0C_PUBCOMB . pep : * 
16: / cgnz_6/ptodata/ 2/pubpaa/US10D PUBCOMB . pep : * 
17 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep:* 
18 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep:* 
19: /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. Score Match Length DB ID 



Description 





1937 


100 . 


o 


369 


10 


US_09-393-696-4 


Sequence 4, Appli 


9 


1909 


98 . 




369 


13 


Us- 10- 07 7-8 7 4- 4 


Sf^nuf^npp 4 Ann! i 


3 


1875 


96 . 


3 


389 


9 


US-09-828-538-20 

VJO \J J U VJ yj *J VJ \J 


SpfTiipnrp 20 Add! 


4 


X vj / O 


96 . 


8 


389 


10 


US-09-211-82 3C-23 

vJ O Z.XX Z.O 


^pnuonrp 23 Annl 

►J v_-« vJ \JL \j 1 1 Z_ ^ iiU _l_ 


s 

•J 


1875 


96 . 


8 


425 


9 


TTa_nq-R9R-S38-1Q 

\JkJ W -J \J C* VJ *J VJ 


S^rniRn ce 1 9 Add 1 


U 


1 R7S 

J. o / J 


27 U • 


o 


4 9 5 


9 


TI < ^-0Q-R9R-5' : ift-94 

UO U J U£. u ooo z.*± 


q /^rn i (^Tt c£> 24 Add 1 


7 


1 R7S 

X O / O 


_? \j • 


ft 

L> 


4 9 S 

L ± z. o 


1 0 


Tjq-OQ-91 1-R9^P-99 


^prniprirp 99 Ann 1 


ft 
O 


1 R7S 




ft 

o 


4 9 S 


1 4 


itc-I 0-99 S-S67A-^f;R 

UO XW Z. Z. -J OU / o u o 


Qprri i pn r*p 9 ^ Q Am^ 

OCmiCll^C O \J (J ^ .TVkv>k7 


Q 

27 


1 ft 7 S 


27 o • 


ft 
D 


4 9 S 
4 Z o 


1 4 

X 4 


itc_i 0-^S9-fiR4 A-99 

Uo XU JJi OOln 


c f-r 1 1 pn r*p 99 Am^l 
O tr tJU.t;Xiv— t: z. , /-\.L^|JX 


i n 
x u 


1 ft74 
lo / 4 


27U . 


7 


4 n? 

4 uz 


1 n 

X u 


uo u 17 o y o \jy\j z. 


Qpni ion r*p 9 Aio'Pi 1 i 

OcljUcIlLC ^ .rA-^J O X X 


1 1 


1 R79 


y u . 


D 


^79 
o / z. 


1 0 


UO UZ7 J U-7U U 


Qprfiipnrp f\ Ar^Til H 
oc^ucm.'C \j j r\^J x x 


1 9 
XZ 


1 R79 
J. o / z 




D 


^79 
o / z. 


_L O 


TT^-I 0— 077-R74-fi- 

UO XU U// 0/4 u 


Q <n /~f 1 ion r"P fci Qr>rN 1 -i 

o tr\_ju triioc; u^ rvppii 


1 9 


i ft *7 n 
x o / u 




c. 
o 


4 9 R 
4 Z o 


i n 
x u 


TTQ_nQ — ft9^ — RfiQ— R4Q 
Uo U -7 OZD o U o4^7 


oecjuerice o4 27 f -^pp 


1 4 
1 4 


1 ft £Q 


2? D . 


D 


4 D9 
4 UZ 


1 9 

X o 


ttq_1 0-077-R74-9 
UO XU U// o/4 z. 


Q o /ti i on r*p 9 Zir*ri 1 i 
O fc; Utlllt^t; r rtppil 


1 S 


1 ft £Q 


Q £ 
JO. 


D 


4 9 S 


Q 


TTc_nQ — — ft 4 ft — 9 

Uo Z70X 040 Z 


^ r*t 1 i ^ i~\ 9 7\ >^\y>i 1 ■? 

oequtrllL-c Z ^ Appil 


J- o 


i / jj 




D 


zl 9 7 
4Z / 


Q 


TTc;_nQ — 790 — — 9 
Uo U 7 jU joX Z 


oecjuence z^ Kppii 


1 "7 


i oof; c 


£ft 


c; 
O 


4 OU 


1 A 
X 4 


ttQ— 1 0— Oft 1 — ft 1 0 — 4^ 
Uo XU UOX OXU 4 D 


oecjueuce 4u, /\ppx 


1 Q 

xo 


lolo . 0 


K) 1 . 


p 

o 


/I /I /l 
4 4 4 


X u 


TTQ— HQ — QQ9_ _1 Q 

Uouyyyzooiiy 


oecjuence X27 A /\ppx 


1 Q 


1919 c; 


o / . 


O 


4 4 il 
4 4 4 


1 A 
X 4 


TTC!_.1 0 — Oft 1 — ft 1 0 — 4 R 
Uo 1U U01 OIU 4o 


jcquence 4o f /\ppx 


9 n 
Z u 


1 Jl J ■ o 


£7 
D / . 


p 
0 


4 4 4 
4 4 4 


1 4 
X 4 


ttc 1 f)— 99 R— SfilA- ^70 

Uo XU oO/.r\. O/U 


O c U. fc: 1 1 O/U/ /-vpp 


9 1 
Z X 


lolo - o 


67. 


8 


AAA 
4 4 4 


1 A 
X 4 


TTQ— 1 0—9(^9 — ^1 9—1 Q 
Uo XU Z OZ oXo 17 


oe(Lju.ence X2?^ /\ppx 


ZZ 


i q n o c: 
i JlJ . D 


67. 


8 


/I /t A 

4 44 


1 A 

1 4 


ttc in n^n 9^q ii 
Uo— 1U~ UbU— oby— 11 


Sequence 11 f Appl 


9 9 


1 Ol Q C 

lolo . o 


67. 


8 


/i /i /I 
4 4 4 


1 4 
1 4 


TTQ — 1 0—1 7ft — 1 Q4 — 9 
Uo 1U 1 to 1:74 Z 


oecjuence z, /\ppxx 


9 Zl 
Z 4 


1 91 9 S 
lolo . o 


67. 


8 


Zl zl 
4 4 4 


1 £ 
X o 


TTQ— 1 0 — 76ft — R7ft — 1 Q 
Uo XU /OO O/O Xj/ 


oecjuence x-7^ /\ppx 


9 S 
Z O 


1919 c 
XoXZ • D 


67. 


8 


AAA 
4 4 4 


1 Zl 
X 4 


TTQ— 1 0 — 9ft9 — 71 7 — 9 
UoXUZOZ / X / Z 


oecjuence z, /\ppxx 


9 £ 
Z D 


1 90R S 
IjUO . O 


67. 


6 


4 4 4 
141 


i n 

X u 


TTQ-0Q — R9 6-S0Q— SSI 

UO U 27 OZ.D O U 27 OOX 


Qnrfiipnpp SSI Av\n 
OcLjUGiIv^t; OOX^ YT.kyk7 


9 7 
Z / 


iz / y 


66. 


0 


Z Do 


1 Zl 
X 4 


TTQ— 1 0 — 97 ft — Oft 7a— S 4 
Uo XU Z /o UO / J\ o4 


oecjuence o4 f /\ppx 


9 R 
Z o 


A ft 9 S 
4 0 o . 3 


25.0 


zi 

4 oU 


Q 


TTQ — OQ — ft<^^ — 9Zlfta— ft 
Uo U .7 ODD Z40/\ O 


oecjuence O/ /\ppxx 


9 Q 
Z 27 


4 ft 9 S 
4 O o . o 


25. 


0 


4 1 n 

4 o U 


1 zl 

X 4 


TTQ—1 0 — 99S— ^f^l'h—^^P. 
Uo XU ZZJ oD/r\ DoO 


O Li LJ Li 1 fcj DJO, 7A.pp 


90 


4 O O ■ O 


25. 


0 


4 

*± o u 




ttq_ i 0-71 Q-SR7-S4 

UO XU / X 27 OO/ Jl 


Qpimipm^^ S4 Atm^I 
O ClLJU.t^ill_t: 04 f Appi 


91 


4 ft "3 S 
4 0 o . o 


25. 


0 


4 41 
4 4 1 


1 4 
X 4 


TTQ— 1 0— 9Q9-7QR-RQ0 

Uo XU Z-?Z / 27 O 0 27U 


OciCJUfctllCfcr 027U/ App 


99 
JZ 


Zl ft n S 


24. 


8 


4^9 
4 oZ 


q 


TTQ — 0Q — ft^f^— 94ftZ\— 9 
Uo U27 0DD Z4 OZA Z 


oecjuence z ^ /\ppxx 


9 9 
JJ 


/i ft n ^ 

4 0 U . o 


24. 


8 


A ^9 
4 oZ 


x o 


TTQ—1 O — 71 Q— Sft7 — 97 
Uo XU /X27 JO / J / 


oecjuence o / 1 /\ppx 


94 
O 4 


47Q S 


24. 


8 


4 9 R 
4 Z O 


q 


TTQ-0Q — 9Q9-Q7^-4 

UO U27 Z.27Z. 27/0 4 


^pmipn OP 4 Ar\r> "1 -i 

o t;tju.t;xiot: 4 / rippil 


9S 

O J 


47 9 S 
4 / O . o 


24. 


4 


4 9 0 


q 


UO U27 OUU Z-H O ./A U 


Q p /^f i lpripp 7\>-\r-\ 1 i 

ocLjutriiv^tr U/ rtppi J. 


9£ 
o D 


47 9 S 

4 / O . O 


24. 


4 


4 90 


1 4 

X 4 


TTQ—1 0-OfiO-^fiQ-Q 

UO XU UUU O U 27 27 


Qprfiipnr'P Q Bnn 1 H 
O t: vJ U ti I J. C- C 27/ rT.kJ|J X X 


o f 


/4 7 9 S 

4 / o . o 


24. 


4 


R99 
oZZ 


1 4 
X 4 


TTQ—1 0— Oft 1 — ft 1 0— S9 
Uo XU UOX OXU OO 


oecjuence oo^ /\ppx 


^R 
o o 


473 5 
" / o • o 


24. 


4 


S99 


1 4 

X 1 


ttq — 1 0-9 2 5- 5 67 A- SI 2 

UO X\J i il. J — ' W / xv O X £^ 


Q <=»rri i <^r> r*f=» SI 9 Ann 


39 


473.5 


24. 


4 


522 


14 


US-10-255-551-2 


Sequence 2 f Appli 


40 


473.5 


24. 


4 


522 


15 


US-10-072-012-360 


Sequence 360, App 


41 


473.5 


24. 


4 


522 


15 


US-10-072-012-361 


Sequence 361, App 


42 


473.5 


24. 


4 


522 


15 


US-10-276-774-2093 


Sequence 2093, Ap 


43 


473.5 


24. 


4 


522 


17 


US-10-757-262-8 


Sequence 8, Appli 


44 


467.5 


24. 


1 


417 


15 


US-10-072-012-358 


Sequence 358, App 


45 


453 


23. 


4 


426 


9 


US-09-292-973-19 


Sequence 19, Appl 



ALIGNMENTS 



RESULT 1 
US-09-393-696-4 

; Sequence A, Application US/09393696 
; Publication No. US20030022277A1 
; GENERAL INFORMATION: 



; APPLICANT: Human Genome Sciences , Inc. et al . 

TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/09/393, 696 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/462,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS: 23 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 4 

LENGTH: 369 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-393-696-4 

Query Match 100.0%; Score 1937; DB 10; Length 369; 

Best Local Similarity 100.0%; Pred. No. 1.7e-172; 

Matches 369; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YEDE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ME P S AT P GAQMGVP PGS REP SPVPPDYEDE FLRYLWRD YLYPKQ YEWVL I AAYVAVFWA 60 



Qy 61 LVGNTLVCJ^WRNHHMRTVTNYFIWLSrJ^VLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

I I I I i I I I I I I I I I I I I I LI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVTVIQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7VRRKTAKML 300 

Qy 301 MWLLVFALC Y L P I S VLNVL K RVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 301 MWLLVFALC YLP I S VLNVL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 



.Qy 361 LSGLPWSLL 369 

I I I I I I I I I ♦ 
Db 361 LSGLPWSLL 369 



RESULT 2 
US-10-077-874-4 

Sequence 4, Application US/10077874 
Publication No. US20020115155A1 
GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 



NUMBER OF SEQUENCES : 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences , Inc. 
STREET: 9410 Key West Avenue 
; CITY: Rockville 

STATE: MD 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/077 , 87 4 
FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
; ATTORNEY/ AGENT INFORMATION: 

NAME: Wales, Michele M. 
; REGISTRATION NUMBER: 43,975 

REFERENCE/ DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 369 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
US-10-077-874-4 

Query Match 98.6%; Score 1909; DB 13; Length 369; 

Best Local Similarity 98.6%; Pred. No. 7.1e-170; 

Matches 364; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 
Qy 1 ME P SAT PGAQMGVP P G S RE P S P VP PD YEDE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 



Db 




Qy 



61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



I I I II I I I I I I I I I I I I I I I HI I I I I I I I I I I I I IN I I I I I I I I I I I I I I I I I I I I I I 



Db 



61 L VGNT L VC LAVWRNH HMRT VTN Y F I VN L S LAD VLVT AI CL PAS L LVD I T E S W L FGHAL C K 12 0 



Qy 



121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 





Db 



121 VIPYLQAVSVSVAVLTLSFI7VLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 



Db 



181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 AVKECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Qy 



241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL L VFAL C YL P I S VLN VL KRVFGMFRQ AS D REAVYAC FT F S HWL VYAN S AAN P 1 1 YN F 360 

II II I I I I I I II I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFALC Y L P I S VLN VL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGLPWSLL 369 

MINIMI 

Db 361 LSGLPWSLL 369 



RESULT 3 

US-09-828-53.8-20 

Sequence 20, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT : Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 

CURRENT APPLICATION NUMBER: US/09/828 , 538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 20 
LENGTH: 389 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-20 

Query Match 96.8%; Score 1875; DB 9; Length 389; 

Best Local Similarity 98.6%; Pred. No. l.le-166; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 
Qy 1 ME ? SAT P GAQMGVP P GS RE P S P VP P D Y E D E FL RY LWRD YL Y P KQ YEWVL I AAY VAVFVVA 60 

I II M I M M M M I I M I I M M I I I I I I M I I II I I M M I M I I M M I M I M I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCIAWRNHHMRTV^NYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I II I II I I M I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSIADVXvTAICLPASLLVT)ITESWLF 120 

Qy 121 VI PYLQAVSVSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGSI LGIWAVSLAIMVPQA 180 

I II I I I I I II I I I I I I I I I I M I I I I I I II I I II I I I II II I I I I I I I I I I II I I I II I 

Db 121 VI PYLQAVSVSVAVLTLS FIALDRWYAICHPLLFKST7VRRARGS I LGIWAVSLAIMVPQA 180 



Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

I I I : I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I U I I I I I 

Db 181 AVMECSSVLPEIJ^RTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YLPI SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANP 1 1 YNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAWACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 4 

US-09-211-823C-23 

Sequence 23, Application US/09211823C 
Publication No. US20030087801A1 
GENERAL INFORMATION: 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION 
TITLE OF INVENTION 



HAGEN, JAMES JOSEPH 
TERRETT, JONATHAN ALEXANDER 
UPTON, NEIL 
PIPER/ DAVID 
SMITH, MARTIN IAN 
KENNETT, GUY ANTHONY 
PATEL, SARASWATI R. 

METHODS OF TREATMENT USING NOVEL LIGANDS 
OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 



TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/ 09/2 11 , 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 23 
LENGTH: 38 9 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-2^1-823C-23 



Query Match 96.8%; 
Best Local Similarity 98.6%; 
Matches 358; Conservative 



Score 1875; DB 10; 
Pred. No. l.le-166; 
2; Mismatches 3; 



Length 389; 



Indels 



0; Gaps 



0; 



Qy 

Db 



1 ME P SAT P GAQMGVP P GS RE P S P VP P D YEDE FLRYLWRDYL Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 MEP SAT P GAQMGVP P GS RE P S P VP P D YEDE FLRYLWRDYL Y P KQ YEWVL I AAYVAVFWA 60 



Qy 



61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGa\LCK 120 

I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 



Db 61 L VGN T L VC LAW RN H HMRT VT N Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C K 120 

Qy 121 VI P Y LQ AVS V S VAVLT L S F I P L D RW Y AI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI P YLQAVSVS VAVLT LSFIALDRWYAI CHPLLFKS TAR RARGS I LGI WAVS LAI MVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLM7\MAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7VRRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWL VYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWL LVFAL C YL P I S VLNVL KRVFGMFRQAS DREAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 5 

US-09-828-538-19 

Sequence 19, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 

CURRENT APPLICATION NUMBER: US/ 09/ 828 , 538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version '3 . 0 
SEQ ID NO 19 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-19 



Query Match 96.8%; 
Best Local Similarity 98.6%; 
Matches 358; Conservative 



Score 1875; DB 9; Length 425; 
Pred. No. 1.3e-166; 
2; Mismatches 3; Indels 0; 



Gaps 



0; 



Qy 



1 ME P SAT P GAQMGVP P G S RE P S P VP P D YEDE FLRYLWRD YL YP KQ YEWVLI AAYVAVFWA 60 



Db 



1 



I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 61 LVGNTLVCIAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSIJVDVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P Y L Q AVS VS VAVL TLSFIPLD RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 

I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMQSSSVLPEIANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQI PGTT S ALVRNWKRP S DQLGDLEQGLS GEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN P 1 I YN F 360 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN P 1 1 YN F 360 

Qy 361 LSG 363 

HI 

Db 361 LSG 363 



RESULT 6 

US-09-828-538-24 

Sequence 24, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 

CURRENT APPLICATION NUMBER: US/09/828,538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 66/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID, NO 24 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-24 



Query Match 96.8%; Score 1875; DB 9; Length 425; 

Best Local Similarity 98.6%; Pred. No. 1.3e-166; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ME P SAT PGAQMGVP PGS RE P S PVP PD YEDEFLRYLWRDYLYPKQYEWVLI AAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCIAWRNHHMRTVTNYFIW 120 

Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CH PL L FK S TARRARG S I L GI WAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 121 VI P YLQAVS VS VAVLT L S F I AL D RW YAI CH P L L FK S TARRARG S I LGI WAVS LAIMVPQA 180 

Qy 181 AVMQSSSVTiPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYI^PLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQ I PGTT SALVRNWKRP S DQLGDLEQGL S GEPQ P RGRAFLAEVKQMRARRKT AKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALTONWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 7 

US-09-211-823C-22 

Sequence 22, Application US/09211823C 
Publication No. US20030087801A1 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 

APPLICANT: SMITH, MARTIN IAN ' 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 

TITLE OF, IInjVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS v: 
TITLE OF INVENTION: . OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211 , 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 



; SEQ ID NO 22 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-22 

Query Match 96.8%; Score 1875; DB 10; Length 425; 

Best Local Similarity 98.6%; Pred. No. 1.3e-166; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

1 ME P SAT P GAQMGVP P GS RE P S PVP P D YEDE FLRYLWRD YL YP KQ YEWVLI AAYVAVFVVA 60 

61 LVGNT LVC LAVW RNHHMRT VTN Y F I VN L S LADVLVTAI C L PAS L LVD I T E S WL FGHAL C K 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLW 120 

121 VI P YLQAVSVS VAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGI WAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 VI P Y LQ AVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I LG I WAVS LAI MVP QA 180 

181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 AVTVIECSSVLPEIJ^NRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 KLWGRQIPGTTSALVT^WKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

301 MWLLVFALC YL P I S VLN VL KRVFGMFRQAS D REAVYAC FT FS HW LVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 MWLLVFALC YLP I S VLNVL KRVFGMFRQAS DREAVYAC FT FSHW LVYAN SAAN P 1 1 YN F 360 

361 LSG 363 
I I I 

361 LSG 363 



RESULT 8 

US-10-225-567A-368 

; Sequence 368, Application US/10225567A 
; Publication No. US20030113798A1 
; GENERAL INFORMATION: 

APPLICANT: Lifespan Biosciences 
; - APPLICANT: Brown, Joseph P. 
; APPLICANT: Burmer, Glenna C. 

APPLICANT: Roush, Christine L. 
; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN-COUPLED 
RECEPTORS (GPCRS) 
; FILE REFERENCE: 1920-4-4 

; CURRENT APPLICATION NUMBER: US/ 10/225 , 567A 

; CURRENT FILING DATE: 2001-12-19 

; PRIOR APPLICATION NUMBER: 60/257,144 

; PRIOR FILING DATE: 2000-12-19 

; NUMBER OF SEQ ID NOS : 2292 

; SOFTWARE: Patentln version 3.1 



Qy 

Db 

Qy 

Db 

Qy v • 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



SEQ ID NO 368 
LENGTH: 425 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-225-567A-368 

Query Match 96.8%; Score 1875; DB 14; Length 425; 

Best Local Similarity 98.6%; Pred. No. 1.3e-166; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEP S AT PGAQMGVP PGS REP S PVP PDYEDEFLRYLWRDYL YPKQYEWVIjI AAYVAVFVVA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLWAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 61 LVGNT LVCLAVWRNHHMRT VTN Y F I VN LS LADVLVT Al C L PAS L LVD I T E S WL FGHAL C K 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFALC YL P I S VLNVL K RVFGM FRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVXNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 9 

US-10-352-684A-22 

Sequence 22, Application US/10352684A 
Publication No. US20030215452A1 
GENERAL INFORMATION: 
APPLICANT: Millennium Pharmaceuticals Inc. 
APPLICANT: Carroll, Joseph M. . 
APPLICANT: . Healy, Aileen 
APPLICANT: Weich, Nadine S. 
APPLICANT: Kelly, Louise M. 

TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TREATING 

TITLE OF INVENTION: HEMATOLOGICAL DISORDERS USING 131, 148, 199, 12303, 
13906, 

; TITLE OF INVENTION: 15513, 17822, 302, 5677, 194, 14393, 28059, 7366, 12212, 
; TITLE OF INVENTION: 1981, 261, 12416, 270, 1410, 137, 1871, 13051, 1847, 
1849, 

; TITLE OF INVENTION: 15402, 340, 10217, 837, 1761, 8990 OR 13249 MOLECULES 
; FILE REFERENCE: MPI02-019P1RNOMNIM 



; CURRENT APPLICATION NUMBER: US/10/352, 684A 

; CURRENT FILING DATE: 2003-01-28 

; PRIOR APPLICATION NUMBER: US 60/354,333 

; PRIOR FILING DATE: 2002-02-04 

; PRIOR APPLICATION NUMBER: US 60/360,258 

; PRIOR FILING DATE: 2002-02-28 

; PRIOR APPLICATION NUMBER: US 60/364,476 

; PRIOR FILING DATE: 2002-03-15 

; PRIOR APPLICATION NUMBER: US 60/375,626 

; PRIOR FILING DATE: 2002-04-26 

; PRIOR APPLICATION NUMBER: US 60/386,494 

; PRIOR FILING DATE: 2002-06-06 

; PRIOR APPLICATION NUMBER: US 60/390,965 

; PRIOR FILING DATE: 2002-06-24 

PRIOR APPLICATION NUMBER: US 60/392,480 
; PRIOR FILING DATE: 2002-06-28 
; PRIOR APPLICATION NUMBER: US 60/394,128 

PRIOR FILING DATE: 2002-07-03 
; PRIOR APPLICATION NUMBER: US 60/399,783 
; PRIOR FILING DATE: 2002-07-31 
; PRIOR APPLICATION NUMBER: US 60/403,221 

PRIOR FILING DATE: 2002-08-13 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 62 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 22 
LENGTH: 42 5 
TYPE: PRT 
; ORGANISM: Homo Sapiens 
US-10-352-684A-22 

Query Match 96.8%; Score 1875; DB 14; Length 425; 

Best Local Similarity 98.6%; Pred. No. 1.3e-166; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 MEP SAT PGAQMGVPPGS REPS PVPPDYEDEFLRYLWRDYLYPKQYEWVXIAAYVAVFVVA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1 ME P S AT PGAQMGVP P GS RE P S PVP P D YE DE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

Qy 61 LVGNT LVC LAVWRNHHMRTVTN Y FI VN L S LADVL VTAI CL P AS LLVD I T E SWL FGHALC K 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 

Db 61 LVGNT LVC LAVWRNH HMRT VTN Y F I VN L S LADVL VTAI C L PAS L LVD I T E S W L FGHALC K 120 

Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARGS I L G I WAVS LAI MVP QA 180 

I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I 
Db 121 VI P YLQAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I LG I WAVS LAI MVP QA 1G0" 

Qy 181 AVT4QSSSVLPELANRTRLFSLCHERWAJDDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: 111111111111111:1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVKECSSVLPELANRTRLFSVCDERWA£)DLYPKIYHSCFFIvT?YLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFAL C YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



301 MWL LVFAL C YL PIS VLNVLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 



Qy 



361 LSG 363 



Db 



361 LSG 363 



RESULT 10 
US-09-393-696-2 

; Sequence 2, Application US/09393696 
; Publication No. US20030022277A1 
; GENERAL INFORMATION: 

; APPLICANT: Human Genome Sciences , Inc. et al . 

TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/09/393, 696' 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/4 62,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: Patentln Ver . 2.0 

; SEQ ID NO 2 

LENGTH: 402 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-393-696-2 

Query Match 96.7%; Score 1874; DB 10; Length 402; 

Best Local Similarity 98.3%; Pred. No. 1.5e-166; 

Matches 357; Conservative 2; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 ME P SAT PGAQMGVP P GS RE P S P VP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIPAYVAVFWA 60 

Qy 61 LVGN T LVC LAW RNHHMRT VTN Y F I VNL S LADVLVT AI C L P AS LLVD I T E S WL FGHAL C K 12 0 




Db 



61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 





Db 



121 VIPYLQAVSVSVAVLTLSFI7VLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 



Qy 



,. ^81 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAFfe 240 




Db 



181 AVMECSSVLPELANRTRLFSVCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 300 




Db 



241 NLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



Qy 301 
Db 301 



MWLLVFALCYLPI SVLNVLKRVFGMFRQASD REAVYAC FT FS HWLVYAN SAAN PI I YNF 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MWL LVFAL CY L P I S VLNVL KRVFGMFRQ AS D REAVYAC FT F S HWLVYAN SAAN P 1 1 YN F 360 



Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 11 
US-09-393-696-6 

; Sequence 6, Application US/09393696 
; Publication No. US20030022277A1 
; GENERAL INFORMATION: 

; APPLICANT: Human Genome Sciences, Inc. et al . 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/ 09/393 , 696 
; CURRENT FILING DATE: 1999-09-10 
; EARLIER APPLICATION NUMBER: PCT/US95/ 05616 
; EARLIER FILING DATE: 1995-05-05 

EARLIER APPLICATION NUMBER: US08/4 62 , 509 
; EARLIER FILING DATE: 1995-06-05 
; NUMBER OF SEQ ID NOS : 23 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 6 

LENGTH: 372 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-393-696-6 

Query Match 96.6%; Score 1872; DB 10; Length 372; 

Best Local Similarity 98.3%; Pred. No. 2.1e-166; 

Matches 357; Conservative 3; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ME P S AT PGAQMGVP PGS RD P S PVP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 60 

Qy 61 LVGNTLVC^WRNHHMRTWNYFIWLSLAD 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVXVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P YLQAVS VSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 121 VI P Y LQ AVS VS VAVLT L S F I ALD RW YAI CH P LL FKS T ARRARG S I L G I WAVS LAI MVP QA 180 

Qy 181 AVMQSSSVXPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQi 240 

III: II II I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVXPELANRTRLFSVCDERWADDLYPKIari 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMR7^RRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 



Db 



361 LSG 363 



RESULT 12 
US-10-077-874-6 

; Sequence 6, Application US/10077874 
; Publication No. US20020115155A1 
; GENERAL INFORMATION: 

; APPLICANT : Soppet, Daniel et al 

; TITLE OF INVENTION: Human Neuropeptide Receptor 

; NUMBER OF SEQUENCES: 12 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
; CITY: Rockville 

; STATE: MD 

COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/077,874 
FILING DATE: 20~Feb-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
; REFERENCE/DOCKET NUMBER: PF168P1D1 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 372 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-10-077-874-6 

Query Match 96.6%; Score 1872; DB 13; Length 372; 

Best Local Similarity 98.3%; Pred. No. 2.1e-166; 

Matches 357; Conservative 3; Mismatches 3; Indels 0; Gaps 0 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I II I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSRDPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSIADVLWAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVXVTAICLPASLLVTIITESWLFGHALCK 120 



Qy 121 VI P YLQAVS VS VAVLT L S F I P LD RW YAI CH P LL FKS T ARRARG S I LG I WAVS LAI MVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIP Y LQAVSVS VAVLT L S F I AL D RW Y AI CHPLLFKS T ARRARG S I LG I WAVS LAI MVP QA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

III: 111111111111111:1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKI YHSCFFI VTYLAPLGLMAMAYFQIFR 240 

Qy 2 41 KLWGRQI PGTT SALVRNWKRP S DQLGDLEQGLS GEPQ PRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQI PGTT S7VLVRNWKRPS DQLGDLEQGLS GEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVL1WLKRVFGMFRQASDREAWACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YL P I S VLN VLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 13 
US-09-826-509-549 

; Sequence 549, Application US/09826509 
; Publication No. US20030204073A1 
; GENERAL INFORMATION: 

; APPLICANT : Lehmann-Bruinsma, Karin 
; APPLICANT: Liaw, Chen W. 
; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: No. US20030204073A1-Endogenous , Constitutively Activated 
Known G 

TITLE OF INVENTION: Protein-Coupled Receptors 
; FILE REFERENCE: AREN-207 

; CURRENT APPLICATION NUMBER: US/09/826, 509 
; CURRENT FILING DATE: 2001-04-05 
; PRIOR APPLICATION NUMBER: 60/195,747 
; PRIOR FILING DATE: 2000-04-07 
; PRIOR APPLICATION NUMBER: 09/170,496 
; PRIOR FILING DATE: 1998-10-13 
; NUMBER OF SEQ ID NOS : 589 
; SOFTWARE: Patentln Version 2.1 
; SEQ ID NO 549 
LENGTH: 425 
; TYPE: PRT 

ORGANISM: Homo sapiens < . ? 

US-09-826-509-549 

Query Match 96.5%; Score 1870; DB 10; Length 425; 

Best Local Similarity 98.3%; Pred. No. 3.7e-166; 

Matches 357; Conservative 2; Mismatches 4; Indels 0; Gaps 0; 
Qy 1 ME P SAT P GAQMGV P PGS RE P S PVP PD YEDE FLRYLWRD YL YPKQ YEWVLI AAYVAVFWA 60 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 ME P S AT PGAQMGVP PG S RE P S PVP P D YEDE FLRYLWRD YLYP KQYEWVL I AAYVAVFVVA 60 



Qy 



61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 LVGNTLVCI^VWRNHHMRTWNYFI 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 121 VI P YLQAVSVS VAVLTL S FI ALDRWYAI CHPLLFKSTARRARGS I LGI WAVS LAIMVPQA 180 



Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: II I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 3 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTKKML 300 



Qy 301 MWLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAAN PI I YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWL LVFALC Y L P I S VLNVL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 14 
US-10-077-874-2 

; Sequence 2, Application US/10077874 
; Publication No. US20020115155A1 

GENERAL INFORMATION: . . 

APPLICANT: Soppet, Daniel et al 
; TITLE OF INVENTION: Human Neuropeptide Receptor 

; NUMBER OF SEQUENCES: 12 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
CITY: Rockville 
; STATE: MD 

COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln . Release Jl . 0, Version #1.30 
; CURRENT APPLICATION DATA: 

"V* APPLICATION NUMBER: US/10/077,874 

FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 



TELEFAX: 301-309-8439 
- INFORMATION FOR SEQ ID NO: -2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 402 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-077-874-2 

Query Match 96.5%; Score 1869; DB 13; Length 402; 

Best Local Similarity 98.3%; Pred. No. 4.3e-166; 

Matches 357; Conservative 2; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YEDE FL RYLWRD YL YP KQ YEVA/TjI AAWAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLWAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI PYLQAVSVSVAVLTLS FI PLDRWYAI CHPLLFKSTARRARGS I LGI WAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 121 VI PYLQAVSVSVAVLTLS FIALDRWYAI CHPLLFKSTARRARGSI LGIWAVS LAIMVPQA 180 

Qy 181 AVMQSSSVliPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Ml: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANP 1 1 YNF 360 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | |*| | | | | |.| | | | | | 

Db 301 MWLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANP 1 1 YNF 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 15 
US-09-961-848-2 

Sequence 2, Application US/09961848 
Patent No. US20020146719A1 
GENERAL INFORMATION: 
APPLICANT: Berglind Ran Olafsdottir 
APPLICANT: Jeffrey Gulcher 

TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 
FILE REFERENCE: 2345.1005-004 
CURRENT APPLICATION NUMBER: US/ 09/961 , 84 8 
CURRENT FILING DATE: 2001-09-24 
PRIOR APPLICATION NUMBER: US 09/479,128 
PRIOR FILING DATE: 2000-01-07 
PRIOR APPLICATION NUMBER: US 09/379,083 



PRIOR FILING DATE: 1999-08-23 
NUMBER OF SEQ ID NOS : 22 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2 
LENGTH: 425 
TYPE: PRT 

ORGANISM: Homo Sapiens 
US-09-961-848-2 

Query Match 96.5%; Score 1869; DB 9; Length 425; 

Best Local Similarity 98.3%; Pred. No. 4.6e-166; 

Matches 357; Conservative 2; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D Y ED E FL R YLWRD Y L Y P KQ Y EWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I l-l I 
Db 1 ME P SAT PGAQMGVPPGS RE PSPVPPDYEDEFLRYLWRDYLYPKQYEWVL I AAYVAVFWA 60 

Qy 61 LVGNTLVCLAVWRNHHMRTOTNYFIWLSLADVTWTAIC^ 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VI P YLQAVS VS VAVLT LS F I AL D RW YAI CH P L L FKS T ARRARG S I L GI WAVS LAI MVPQA 180 

Qy 181 AWQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 181 AVTVIECSSVIjPEIANRTRLFSVCDERWADDLYPKIYHSCFFIWYIA.PLGLMAMAYFQ 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVl^WKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAANPI I YN F 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



Search completed: October 14, 2004, 11:16:35 
Job time : 69.6732 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 14, 2004, 10:41:50 ; Search time 96.8388 Seconds 

(without alignments) 
2192.441 Million cell updates/sec 

US-10-070-532-4 
1937 

1 ME P SAT P GAQMGVP P G S RE P NSAANPIIYNFLSGLPWSLL 369 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1825181 seqs, 575374646 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1825181 



Database 



UniProt_02:* 
1: uniprot_sprot : * 
2: uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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ALIGNMENTS 



RESULT 1 
0X1R_HUMAN 

ID 0X1R_HUMAN STANDARD; PRT; 425 AA. 

AC 043613; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) . 

GN Name-HCRTRl ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Crahiata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] ^ 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98150861; PubMed=9491897 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H. f Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S. f 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A., Elshourbagy N.A., Bergsma D.J. f 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 



RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585 (.1998) . 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed-11340621 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed-11283317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 
CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 

CC exclusively coupled to the G(q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright- It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC — 

DR EMBL; AF041243; AAC39601.1; 

DR Genew; HGNC4 848; HCRTR1. 

DR MIM; 602392; -. 

DR GO; ■ GO: 0005887; Cintegral to plasma membrane; TAS . 

DR GO; GO: 0004930; F: G-protein coupled receptor activity; TAS. 

DR GO; GO: 0007631; P: feeding behavior; TAS. 

DR GO; GO: 0007218; P : neuropeptide signaling pathway; TAS. 

DR. GO; GO:0007268; P:synaptic transmission; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004059; Orexin_receptorl . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01521; OREXIN1R. 



DR 


PROSIT^; 


PS00237; 


G_PROTEIN_ 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


g_protein" 


_RECEP_F1_2 ; 1 . 


KW 


G-protein 


coupled 


receptor; 


Transmembrane . 


FT 


DOMAIN 


1 


46 


Extracellular (Potential) . 


FT 


TRANSMEM 


47 


67 


1 (Potential) . 


FT 


DOMAIN . 


68 


80 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


81 


102 


2 (Potential) . 


FT 


DOMAIN 


103 


119 


Extracellular (Potential) . 


FT 


TRANSMEM 


120 


142 


3 (Potential) . 


FT 


DOMAIN 


143 


164 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


165 


185 


4 (Potential) . 


FT 


DOMAIN 


186 


216 


Extracellular (Potential) . 



FT 




217 


239 

-J z> 


S ( Pnfpnt" -i si 1 


FT 

r i 




24 0- 




P*\ /-)- 1 ;p qtti "I r+ f Pnt"pnt" i a 1 \ 
^•_y LU|J± aouLX u V irvJL-triii — LclJ_ ^ • 




TRANSMFM 


299 


321 




FT 


DOMAIN 


322 


336 


Extracellular (Potential) . 


FT 


TRANSMEM 


337 


360 


7 (Potential) . 


FT 


DOMAIN 


361 


425 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


194 


194 


N-linked (GlcNAc. . . ) (Potential) 


SQ 


SEQUENCE 


425 AA; 


47521 


MW; 1634083DE10CA092 CRC64; 



Query Match 96.8%; Score 1875; DB 1; Length 425; 

Best Local Similarity 98.6%; Pred. No. 1.5e-120; 

Matches 358; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 



Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP PDYEDEFLRY LW RD YL Y P KQ Y EWVL I AAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNT LVC LAVW RNHHMRT VTN Y FI VN L S LADVLVT AI C L PAS L LVD I T E S WL FGHALC K 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 LVGNTLVCiyVWRNHHMRTWNYFIWLSI^VDVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI P Y LQAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS TARRARG S I L G I WAVS LAIMVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYl^ 24 0 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPEiy\NRTRLFSVCDERWADDLYPKIYHSCFFIWYIAPLGLMAMAYFQIFR 240 



Qy 241 KLWGRQIPGTTSALVR1WKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVl^WKRPSDQLGDLEQGLSGEPQPRGRAFIxAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAWACFTFSHWLVY7\NSAANPIIYNF 360 

Qy 361 LSG 363 

I I I 

Db' 361 LSG 363 



RESULT 2 
Q9HBV6 

ID Q9HBV6 PRELIMINARY; PRT; 425 AA. 

AC Q9HBV6V . k 'v : 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE Hypocretin receptor-1 (Orexin receptor 1) . 

GN Name=HCRTRl; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM . N . A. 



RX MEDLINE-20429525; PubMed-10973318 ; 

RA Peyron C, Faraco J. , Rogers W., Ripley B., Overeem S., Charnay Y., 

RA Nevsimalova S., Aldrich M., Reynolds D., Albin R., Li R., Hungs M. , 

RA Pedrazzoli M. , Padigaru M. , Kucherlapati M. , Fan J., Maki R. r 

RA Lammers G.J., Bouras C, Kucherlapati R. , Nishino S., Mignot E. ; 

RT "A mutation in a case of early onset narcolepsy and a generalized 

RT absence of hypocretin peptides in human narcoleptic brains."; 

RL Nat. Med. 6:991-997(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21580342; PubMed=l 172 3285 ; 

RA Olafsdottir B.R., Rye D . B . , ■ Scammell T.E., Matheson J.K., 

RA Stefansson K. , Gulcher J.R.; 

RT "Polymorphisms in hypocretin/orexin pathway genes and narcolepsy."; 

RL Neurology 57:1896-1899(2001) . 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Olafsdottir B . R . , Stef ansdottir R.H., Sigurdsson A. , Hannesson H.H., 

RA Sainz J., Scammell T.E., Stefansson K. , Gulcher J.R.; 

RL Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Yeager M. , Welch R., Haque K. , Bergen A.; 

RL Submitted (DEC-2001) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pooled tissue; 

RX MEDLINE=22388257; PubMed-12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P. f Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J W Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J W Malek J.A. f Gunaratne P.H., 

RA Richards S., Worley K.C. f Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G. G. , 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A. , Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pooled tissue; 

RA Strausberg R. ; 

RL Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF202084; AAG28020.1; -. 

DR EMBL; AF202078; AAG28020.1; JOINED. 

DR EMBL; AF202079; AAG28020.1; JOINED. 

DR EMBL; AF202081; AAG28020.1; JOINED. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



EMBL; AF202083 
EMBL ; AF202082 
EMBL; AF202080 
EMBL; AY062030 
EMBL; AY070269 
BC074796 
GO:0016021 
GO:0016499 
GO:0004872 
GO: 0001584 
GO:0007186; 



EMBL; 
GO; 
GO; 
GO; 
GO; 
GO; 



AAG28020. 
AAG28020. 
AAG28020. 
AAL47214. 
AAL50221. 
AAH74796. 



JOINED. 
JOINED. 
JOINED. 



C: integral to membrane; IEA. 
F:orexin receptor activity; IEA. 
F: receptor activity; IEA. 
F: rhodopsin-like receptor activity; IEA. 
P:G-protein coupled receptor protein signalin. 
InterPro; IPR000276; GPCR_Rhodpsn . 
InterPro; IPR000204; Orexin_receptor . 
InterPro; IPR004059; Orexin_receptorl . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR002 37; GPCRRHODOPSN . 
PRINTS; PR01521; OREXIN1R. 
PRINTS; PR01064; OREXINR. 

PROSITE; PS00237; G_J>ROTEIN_RECEP_Fl_l ; UNKNOWN_l . 
PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
Receptor. 

SEQUENCE 425 AA; 47535 MW; B650B37F3A2CA096 CRC64; 



; IEA. 



Query Match 96.5%; 
Best Local Similarity 98.3%; 
Matches 357; Conservative 



Score 1869; DB 2; Length 425; 
Pred. No. 3.9e-120; 
2; Mismatches 4; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

1 ME P SAT P G AQMGVP PGSREPSPVPPDYEDEFLRY LW RD Y L Y P KQ Y E WVL I AAYVAVFWA 60 



Qy 

Db 



61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI CH P L L FK S TARRARG S I L G I WAVS LAI MVPQA 180 

I I I I I I I I I I II I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

181 AVMQSSSVLPEIiANRTRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 AVMECSSVLPEI^RTRLFSVCDERWADDLYPKIYHSCFFIWYIAPLGLMAMAYFQIFR 240 

241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGL S GE PQ P RGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 KLWGRQIPGTTSALVRNWKR-L^SDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

'301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
301 MWLLVFAL C YL P I S VLN VL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 



361 LSG 363 
I I I 

361 LSG 363 



RESULT 3 



0X1R RAT 



ID OX1RJRAT STANDARD; PRT; 416 AA. 

AC P56718; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) . 

GN Name=Hcrtrl; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=98150861; PubMed-9491897 ; 

RA Sakurai T., Amemiya A. , Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A. , Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors : a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998) . 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed-11340621 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M., Mignot E.; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE-21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M. , Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in "the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 
CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 

CC exclusively coupled ^to the G(q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Highly expressed in the brain in the 

CC prefrontal cortex, hippocampus, paraventricular thalamus, 

CC ventromedial hypothalamus, arcuate nucleus, dorsal raphe nucleus, 

CC and locus coeruleus. Not detected in the spleen, lung, liver, 

CC skeletal muscle, kidney and testis. Orexin receptor mRNA 

CC expression has also been reported in the adrenal gland, enteric 

CC nervous system, and pancreas. 

CC -!- INDUCTION: By nutritional state, up-regulated by fasting. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



cc 


modified 


and this 


statement 


is not removed. Usage by 


cc 


entities 


requires 


a license 


agreement (See http://www.i 


cc 
cc 

nD 

Uf\ 


or send an email 


to license@isb-sib . ch) . 


EMBL; AF041244; AAC40041.1; 






RGD; 2787; Hcrtrl 






DR 


InterPro; 


IPR000276; GPCR_Rhodpsn . 




InterPro; 


IPR000204; Orexin_ 


receptor. 


FiR 


InterPro; 


IPR004059; Orexin 


receptorl . 


nD 

U r\ 


Pfam; PF00001; 7tm 1; 1. 




np 


PRINTS; PR00237; 


GPCRRHODOPSN. 


nn 
JJK 


PRINTS; PR01521; 


OREXIN1R. 




nD 

Ur\ 


PROSITE; 


PS00237; 


G_PROTEIN_ 


DCTTD 17*1 1 . 1 


JJK 


* PROSITE; 


PS50262; 


G PROTEIN^ 




VT«7 

JaW 


G-protein coupled receptor; 


Transmembrane . 


£ 1 


DOMAIN 


1 


46 


Extracellular (Potential) . 


r 1 


TRANSMEM 


47 


67 


1 (Potential) . 


r i 


DOMAIN 


68 


80 


Cytoplasmic (Potential) . 


r 1 


TRANSMEM 


81 


102 


2 (Potential) . 


£ 1 


DOMAIN 


103 


119 


Extracellular (Potential) . 


r I 


TRANSMEM 


120 


142 


3 (Potential) . 


r 1 


DOMAIN 


143 


164 


Cytoplasmic (Potential) . 


"FT 

r i 


TRANSMEM 


165 


185 


4 (Potential). 


r 1 


DOMAIN 


186 


216 


Extracellular (Potential) . 


"FT 


TRANSMEM 


217 


239 


5 (Potential) . 


r 1 


DOMAIN 


240 


298 


Cytoplasmic (Potential) . 


r i 


TRANSMEM 


299 


321 


6 ( Potential ) . 


FT 


DOMAIN 


322 


336 


Extracellular (Potential) . 


.FT 


TRANSMEM 


337 


360 


7 (Potential). 


FT 


DOMAIN 


361 


416 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


194 


194 


N-linked (GlcNAc. . .) (Po 


SQ 


SEQUENCE 


416 AA 


; 46799 MW; 774DE7A22EA05D18 CRC64; 



and for commercial 



Query Match 90.6%; Score 1755; DB 1; Length 416; 

Best Local Similarity 92.6%; Pred. No. 2.5e-112; 

Matches 336; Conservative 8; Mismatches 19; Indels 0; Gaps 0; 

Qy 1 ME P SAT PGAQMGVP P GS RE P S PVP P D YEDEFLRYLWRD YL YP KQYEWVX I AAYVAVFVVA 60 

I I I I I I I I I I II I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I :: I 
Db 1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVXVTAICLPASLLVDITESW 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNT LVC LAVWRNH HMRT VTN Y FI VNL S LADVLVT AI C L PAS LLVD I T E S WL FGHALC K 120 

-Qy" 121 VI P YLQAVS VS VAVLT L S F I P L D RW YAI C H P L L FK S TARkARG S I L GI WAVS LAIMVPQ A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I : I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 

Qy 181 AVMQSSSVTjPELANRTRLFSLCHERWADDLYPKIYHSCFFIW 240 

III: I I I I I I II I I I I I I I : I I I I I I : I I I I I I I I I I I I I I I I I I I I I . I I I I I I I I 
Db 181 AvKECSSVLPELT^IRTRLFSVCDERWADELYPKIYHSCFFFVTYLAPLGLMGMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALvIMWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I II I I I I I I : I I III I II II I I I I I I I I I I I I I I I I II I I 

Db 241 KLWGPQIPGTTSALVRNWKRPSEQLEAQHQGLCTEPQPRARAFLAEVKQMRARRKTAKML 300 



Qy 301 MWL LVFAL C YL PIS VLNVLKRVFGMFRQAS D REAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWL LVFALC YL P I S VLNVLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

III 

Db 361 LSG 363 



RESULT 4 
Q6VNS3 

ID Q6VNS3 PRELIMINARY; PRT; 416 AA. 

AC Q6VNS3; 

DT 05-JUL-2004 (TrEMBLrel. 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 21, Last annotation update) 

DE Orexin receptor type-1. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/ c; 

RA Chen J. , Randeva H.S.; 

RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY336083; AAR01326.1; -. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004059; Orexin_receptorl . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01521; OREXIN1R. 

DR PRINTS; PR01064; OREXINR. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; UNKNOWN_l . 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 416 AA; 46766 MW; A8958C594C365E00 CRC64; 

Query Match 90.5%; Score 1753; DB 2; Length 416; 

Best Local Similarity 92.6%; Pred. No. 3.4e-112; 

Matches 336; Conservative 8; Mismatches 19; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I II I I II I I I III I II : I I I I I I I I | I I | I I I I II I I I I I I I I I II I I I I I : : I 
Db 1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVXWAICLPASLLVDITES 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db ' 61 LVGN T LVC LAW RNHHMRTVTN Y F I VN L S LADVLVTAI C L PAS L LVD I T E S W L FGQAL C K 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 



Qy 



181 AViyiQSSSVLPELANRTRLFSLCHERWAI)DLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Db 



181 



III: I I I I I I I I I I I I I I I : I I I I I : I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

AVMECSSVLPELA^^RTRLFSVCDEHWADELYPKIYHSCFFIVTYLAPLGLMG^4AYFQIFR 240 



Qy • 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSEQLEAQHQGLCTEPQPRT^FLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 



Qy 

Db 



361 LSG 363 
I I I 

361 LSG 363 



RESULT 5 
AAR01326 



ID AAR01326 PRELIMINARY; PRT; 416 AA. 

AC AAR01326; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel . 27, Last sequence update) 

DT 02^MAR-2004 (TrEMBLrel. 21, Last annotation update) 

DE Orexin receptor type-1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J., Randeva H.S.; 

RT "Cloning and Characterization of the Mouse Type-1 Orexin Receptor 

RT (OX1R)."; 

RL Submitted (JUL-2003) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY336083; AAR01326.1; -. 

KW Receptor. 

SQ SEQUENCE 416 AA; 46766 MW; A8958C594C365E00 CRC64; 



Query Match 90.5%; Score 1753; DB 2; Length 416; 

Best Local Similarity 92.6%; Pred. No. 3.4e-112; 

Matches 336; Conservative 8; Mismatches 19; Indels 0; Gaps 0; 



Qy 1 ME P S AT P GAQMGVP PGS RE P S P VP P D YEDE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFVVA 60 

I I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I :: I 
Db 1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 6Gv" 

Qy 61 L VGNT L VC LAVW RN HHMRT VTN Y F I VN L S LADVLVT AI C L PAS L LVD I T E S WL FGHAL C K 120 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I 

Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVXVTAICLPASLL 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 

Qy 181 AVMQSSSVLPELANRTRLFSLCHERWADDLYPKIYHSCFFIVTYl^PLGLMAMAYFQIFR 240 

III: I I I I I I I I I I I I II I : I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 



Db 



181 AVMECSSVLPELANRTRLFSVCDEHWADELYPKIYHSCFFIVTYLAPLGLMGMAYFQIFR 240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

I II I I I I I I I I I I I I I I I I I I I : I I III I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQ I PGTT S ALVRNWKRP S EQLEAQHQGLCT E PQ P RARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFAL C YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFAL C YL P I S VLNVL KRVFGM FRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I II 

Db 361 LSG 363 



RESULT 6 
OX2R_RAT 

ID ' OX2R_RAT STANDARD; PRT; 460 AA. 

AC P56719; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN Name=Hcrtr2; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE-98.150861; PubMed=94918 97 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A., Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=1134 062 1 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E.; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RE."?* Bioessays 23:397-408 (2001). ^ 

RN [3] 

RP REVIEW. 

RX MEDLINE=2117 847 6; PubMed-11283317 ; 

RA Willie J.T. f Chemelli R-M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness . "; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Expressed in the brain in the cerebral cortex, 



CC septal nuclei , hippocampus , medial thalamic groups, dorsal and 

CC median raphe nuclei, and many hypothalamic nuclei including the 

CC tuberomammillary nucleus, dorsomedial hypothalamus, 

CC paraventricular hypothalamic nucleus, and ventral premammillary 

CC nucleus. Not detected in the spleen, lung, liver, skeletal muscle, 

CC kidney and testis. Orexin receptor mRNA expression has also been 

CC reported in the adrenal gland, enteric nervous system, and 

CC pancreas . 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



DR EMBL; AF041246; AAC40042.1; -. 

DR RGD; 2788; Hcrtr2 . 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR Pfam; PF03827; Orexin rec2; 1. 



DR 


PRINTS; PR00237; 


GPCRRHODOPSN. 


DR 


PRINTS; PR01522; 


OREXIN2R. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN 


_RECEP_F1_1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN 


_RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Glycoprotein; Transmembrane. 


FT 


DOMAIN 


1 


54 


Extracellular (Potential) . 


FT 


TRANSMEM 


55 


, 75 


1 (Potential) . 


FT 


DOMAIN 


76 


88 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


89 


110 


2 (Potential) . 


FT 


DOMAIN 


111 


127 


Extracellular (Potential) . 


FT 


TRANSMEM 


128 


150 


3 (Potential) . 


FT 


DOMAIN 


151 


172 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


173 


193 


4 (Potential) . 


FT 


DOMAIN 


194 


224 


Extracellular (Potential) . 


FT 


TRANSMEM 


225 


247 


5 (Potential) . 


FT 


DOMAIN 


248 


304 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


305 


327 


6 (Potential) . 


FT 


DOMAIN 


328 


342 


Extracellular (Potential) . 


FT 


TRANSMEM 


343 


366 


7 (Potential) . 


FT 


DOMAIN 


367 


460 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


14 


14 


N- linked (GlcNAc . . .) (Potential) 


FT 


CARBOHYD 


22 


22 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


202 


202 


N-linked (GlcNAc. . .) (Potential) 


SQ 


SEQUENCE 


460 AA; 52489 MW; 3B4 4E3D82F8B85D5 CRC64; 



Query Match 68.5%; Score 1326.5; DB 1; Length 460; 

Best Local Similarity 70.9%; Pred. No! 6.1e-83; 

Matches 253; Conservative 39; Mismatches 44; Indels 21; Gaps 4; 



Qy 

Db 



17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 



-76 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIP-YLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I II I I I I : I : I I I I I : I II : I I I I I I I I I I I I I I : I I 

84 HMRT VT N Y F I VN L S LAD VL VT I T C L PAT L WD I T ET W F FGQ S L C KVI P YLQT VS VS VS VL 143 

136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSIJVIMVPQAAVMQSSSVLPELANR 195 

II I I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I I : I I III: 

144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMERSSMLPGLANK 203 

L96 TRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : : I III : : I I I : I I I I I : I II : I I I II : I I I I I I I I I I I I I I I : I : 
204 TTLFTVCDERWGGEVYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMVVLLV 306 

I I I : I I : I I I I I I I I : I I : II I I I I I : I I I I I I I I 

264 RKWKQP QPVS Q P RG S GQQ S KARI S AVAAE I KQ I RARRKT ARMLMWLLV 312 

307 FALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSG 363 

I I : I I I I I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I 

313 FAICYLPISIL^^VXKRVFGMFTHTEDRETWAWFTFSHWLVY7^SAANPIIYNFLSG 369 



RESULT 7 
OX2R_CANFA 

ID OX2R_CANFA STANDARD; PRT; 444 AA. 

AC Q9TUP7; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN Name=HCRTR2 ; 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99385793; PubMed=10458611 ; 

RA Lin L., Faraco J., Li R. , Kadotani H., Rogers W. f Lin X., Qiu X., 

RA de Jong P.J., Nishino S., Mignot E. ; 

RT "The sleep disorder canine narcolepsy is caused by a mutation in the 

RT hypocretin receptor 2 gene."; 

RL Cell 98:365-376(1999). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PiSbMed=l 134 0621; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M., Mignot E.; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

RN [4] 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



RP VARIANT NARCOLEPSY LYS-54, AND MUTAGENESIS OF GLU-54. 

RX MEDLINE=21180003; PubMed=11282968 ; DOI=10 . 1101/gr. 161001; 

RA Hungs M. , Fan J., Lin L. f Lin X., Maki R.A. , Mignot E.; 

RT "Identification and functional analysis of mutations in the hypocretin 

RT (orexin) genes of narcoleptic canines."; 

RL Genome Res. 11:531-539(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DISEASE: Defects in HCRTR2 are a cause of an autosomal recessive 

CC form of narcolepsy, observed in labradors, dobermans and 

CC dachshunds. Narcolepsy is a neurological sleep disorder affecting 

CC animals and humans, characterized by excessive daytime sleepiness, 

CC sleep fragmentation, symptoms of abnormal rapid-eye-mouvement 

CC (REM) sleep, such as cataplexy, hypnagogic hallucinations, and 

CC sleep paralysis. Cataplexy is a sudden loss of muscle tone 

CC triggered by emotions, which is the most valuable clinical feature 

CC used to diagnose narcolepsy. As in humans, most cases of canine 

CC narcolepsy are sporadic but an autosomal recessive form was also 

CC observed. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC ■ 

DR EMBL; AF164626; AAD49333.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_I; 1. 

DR Pfam; PF03827; Orexin_rec2 ; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01522; 0REXIN2R. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN__RECEP_Fl_2 ; 1. 

KW Disease mutation; G-protein coupled receptor; Glycoprotein; . 



KW 


Transmembrane . 






FT 


DOMAIN 


1 


54 


Extracellular (Potential) . 


FT 


TRANSMEM 


55 


75 


1 (Potential) . 


FT 


DOMAIN 


76 


88 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


89 


110 


2 (Potential). 


FT 


DOMAIN 


111 


127 


Extracellular (Potential) . 


FT 


TRANSMEM 


128 


150 


3 (Potential) . 


FT 


DOMAIN 


151 


172 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


173 


193 


4 (Potential). 


FT 


DOMAIN 


194 


224 


Extracellular (Potential) . 


FT 


TRANSMEM 


225 


247 


5 (Potential) . 


FT 


DOMAIN 


248 


304 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


305 


327 


6 (Potential) . 


FT 


DOMAIN 


328 


342 


Extracellular (Potential) . 


FT 


TRANSMEM 


343 


366 


7 (Potential) . 


FT 


DOMAIN 


367 


444 


Cytoplasmic (Potential) . 



FT CARBOHYD 14 14 N-linked (GlcNAc. . .) (Potential) 

FT CARBOHYD . 22 22 N-linked (GlcNAc. . .) (Potential) 

FT CARBOHYD 202 202 N-linked (GlcNAc. . .) (Potential) 

FT VARIANT 54 54 E -> K (in autosomal recessive 

FT narcolepsy) . 

FT MUTAGEN 54 54 E->K: Loss of function. 

SQ SEQUENCE 444 AA; 50675 MW; D848A4 536D4 8 5D6B CRC64; 



Query Match 68.1%; Score 1319.5; DB 1; Length 444; 

Best Local Similarity 71.4%; Pred. No. 1.8e-82; 

Matches 250; Conservative 42; Mismatches 51; Indels 7; 



Gaps 



3; 



Qy 

Db 

Qy 

Db 



17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVXIAAWAVFWT^VGNTLVCLAVWRNH 

: : I I I I I : I I I I I II I I : I I : I I : I I I I I I I I : I I I I I I I I I I I I : I I I : I I 

24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALVGNVLVCVAVWKNH 



75 



83 



135 



76 HMRTWNYFIWLSLADVXVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 
I I I I I I I I I I I II I I I I I I I I I I I I : I : I I I I I : I II : I I I I I I I I I I I I I I : I I 

84 HMRTVTNYFIWLSLADVLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



136 TLS FI PLDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQAAVMQS S SVLPELANR 
II I I I I II I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I :: N III: 
144 TLSCIALDRWYAI CHPLMFKSTAKRARNS IVI IWI VSCI IMI PQAI VMECSTMLPGLANK 



196 



204 



256 



TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 
I I I : : I I II : : I I I : I I I I I : I I I : I I I II : I I I I II I I I I I I I I I : I : 
TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 



195 



203 



255 



263 



313 



RNWK— RPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 

III : I : I : I : : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

264 RKWKPLQPASQ P RGPGQQT KS RI S AVAAE I KQ I RARRKTARMLMWLLVFAI C YL P 319 

314 I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AANP 1 1 YN FLS G 363 

II : I I I I I I I I I I I III I II I I I I I I I I I I I I I I I I I I I I I I I I 

320 I S I LNVLKRVFGMFTHT EDRET VYAW FT FS HWLVYAN S AAN P 1 1 YN FL S G 369 



RESULT 8 
Q6VLX3 

ID Q6VLX3 PRELIMINARY; PRT; 443 AA. 

AC Q6VLX3; 

DT 05-JUL-2004 (TrEMBLrel. 27 , Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2a. 

• V GN Name=MOXR2; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J., Randeva H.S.; 

RL Submitted (JUL-2003) to the EMBL/ GenBank/ DDB J databases. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein (By similarity). 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



DR EMBL; AY339389; AAR11294.1; 

DR EMBL; AY339383; AAR11294.1; JOINED . 

DR EMBL; AY339384 ; AAR11294 . 1 ; JOINED. 

DR EMBL; AY339385; AAR11294.1; JOINED. 

DR EMBL; AY339386; AAR11294.1; JOINED. 

DR EMBL; AY339387; AAR11294.1; JOINED. 

DR EMBL; AY339388; AAR11294.1; JOINED. 

DR EMBL; AY336084; AAR01327.1; -. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR InterPro; IPR000923; BlueCu_l . 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR Pfam; PF03827; Orexin_rec2; 1. 

DR PRINTS; PRO 02 37; GPCRRHODOPSN . 

DR PRINTS; PR01522; OREXIN2R. 

DR PRINTS; PR01064; OREXINR. 

DR PROSITE; PS00196; COPPER_BLUE; UNKNOWN_l. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 443 AA; 50559 MW; 794736A669463283 CRC64; 



Query Match 67.9%; Score 1315.5; DB 2; Length 443; 

Best Local Similarity 70.3%; Pred. No. 3.3e-82; 

Matches 251; Conservative 40; Mismatches 45; Indels 21; 



Gaps 



Qy 

Db 



17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFVVALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



7 6 HMRTWNYFIWLSLADVIjvTAICLPASLLVTDITESWLFGHALCKVIPYLQAVSVSVAv^ 135 

I I I I I I I I I I I I II I I I I I I I I I I I : I : I I I I I : I I I : I I I I I I I I I I I I I I : I I 

84 HMRTVTNYFIWLSLADVXWITCLPATLVAnDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 

II I I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I II I I : I I : I I III: 

144 TLSCIALDRWYAICHPLMFKSTAKFU^U^SIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 



196 



204 



256 



264 



307 



TRLFSLCHERWADDLYPKIYHSCFFIWY^PLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 
I I I : : I M : I I I I I : I I I : I I I II :|| Ilillll lllllhl : 

TTLFTVCDEHWGGEVYPKMYHI CFFLVT YMAPLCLMI LAYLQI FRKLWCRQI PGTS S WQ 263 

RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMWLLV 306 

III: : I : I i rl I I I I : I I : I I I I I I I : I I I I I I I I 

RKWKQ QQPVS QPRGSGQQSKARISAVAAEIKQIRARRKTARMLMWLLV 312 



FALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANSAANP 1 1 YN FLS G 363 
I I : I I I I I I : I I I I I I I I II I III III I I I I I I I I I I I I I I I I I I I I I I I I 

313 FAICYLPISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSG 369 



RESULT- 9 
AAR01327 

ID AAR01327 PRELIMINARY; PRT; 443 AA. 

AC AAR01327; 



DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel. 21, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 21, Last annotation update) 

DE Orexin receptor type-2a. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-BALB/c; 

RA Chen J. , Randeva H.S.; 

RT "Cloning and Characterization of the Mouse Type-2a Orexin Receptor 

RT Subtype (0X2aR)."; 

RL Submitted (JUL-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY336084; AAR01327.1; 

KW Receptor. 

SQ SEQUENCE 443 AA; 50559 MW; 794736A669463283 CRC64; 



Query Match 67.9%; Score 1315.5; DB 2; Length 443; 

Best Local Similarity 70.3%; Pred. No. 3.3e-82; 

Matches 251; Conservative 40; Mismatches 45; Indels 21; Gaps 4; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I' I I I I I I II : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWPCNH 83 

Qy 76 HMRTVTNYFIWLSIJU)VLVTAICLPASLLVT)ITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I II I I : I : I I I I I : I II : I I I I ! I I I I I I I I I : I I 
Db 84 HMRTWNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVl^QSSSVXPEIJ^R 195 

III I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I I : I I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMI PQAIVMECSSMLPGLANK 203 

Qy 196 TRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I M::| I I ::|||:|| llhllhlll II :|| lllllll lllll|:| : 
Db 204 TTLFTVCDEHWGGEVTPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMWLLV 306 

Ml: : I : I I I I I I I I : I I : I I I I I II : I I I I I I I I 

Db 264 RKWKQ QQPVS Q P RG S GQQ S KARI S AVAAE I KQ I RARRKT ARMLMWLLV 312 

Qy 307 FALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSG 363 

I I : I I I I I I : I I I I I I III III I I I I I I I II I I I I I I I I I I I I I I I 

Db 313 FAI C YL ? I S I LNVLKRVFGMFTHT EDRET VYAW FT FS HWLVYANS AAN P 1 1 YN FLS G 369 



RESULT 10 
AAR11294 

ID AAR11294 PRELIMINARY; PRT; 443 AA. 

AC AAR11294; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2a. 

GN MOXR2. 



OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J., Randeva H.S.; 

RT "Genomic structure analysis of the Mus musculus orexin type-2 (M0XR2) 

RT gene . " ; 

RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY339383 ; AAR11294 . 1 ; JOINED. 

DR EMBL; AY339384; AAR11294.1; JOINED. 

DR EMBL; AY339385; AAR11294.1; JOINED. 

DR EMBL; AY339386; AAR11294.1; JOINED. 

DR EMBL; AY339387; AAR11294.1; JOINED. 

DR EMBL; AY339388; AAR11294.1; JOINED. 

DR EMBL; AY339389; AAR11294.1; -. 

KW Receptor. 

SQ SEQUENCE 443 AA; 50559 MW; 794736A669463283 CRC64 ; 



Query Match 67.9%; Score 1315.5; DB 2; Length 443; 

Best Local Similarity 70.3%; Pred. No. 3.3e-82; 

Matches 251; Conservative. 40; Mismatches 45; Indels 21; Gaps 4; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIA^WAVEWALVGNTLVCLAVWRNH 7 5 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVXIAGYIIVFWALIGNVXVCVAVWKNH 83 

Qy 76 HMRTWNYFIWLSLAI)VXWAICLPASLLVDITESWLFGHALCKV 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I I I I I I I I I I I I I I : II 
Db 84 HMRTWNYFIWLSIADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVu 143 

QY 136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 

Ml I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I I : I I | | | : 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

QY 196 TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I M::| | I ::|||:M ll|:|||:||| II :|| lllllll Mlllhl : 
Db 204 TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMWLLV 306 

IM= : I : I I I I I I I I : I I : I I I I I I I : I I I I I I ! I 

Db 264 RKWKQ QQPVS QPRGSGQQSKARISAVAAEIKQIRARRKTARMLMWLLV 312 

Qy 307 FALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN o AAN P 1 1 YN FL S G 363 

I I : I I I I I I : I I I I I I I I I I I III II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 313 FAI C YL P I S I LNVL KRVFGMFTHT EDRET VYAW FT FS HWLVYAN S AAN P 1 1 YN FL S G 369 



RESULT 11 
, OX2R_MOUSE 

ID OX2R_MOUSE STANDARD; PRT; 460 AA. 

AC P58308; Q8BG12; 

DT 16-OCT-200I (Rel. 40, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 05-JUL-2004 (Rel. 44 , Last annotation update) 



DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2). 

GN. Narne=Hcrtr2 ; Synonyms =Mox2r ; 

OS Mus mus cuius (Mouse) . * 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae,*: Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Cerebellum, and Hypothalamus; 

RX MEDLINE=22354683; PubMed=12466851 ; DOI-10 . 1038/nature01266; 

RA Okazaki Y., Furuno M. , Kasukawa T., Adachi J. , Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K., Tomaru Y., Hasegawa Y. , Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D., Brusic V. , Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N. , Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H., Kawasawa Y., Kedzierski R.M. , King B.L.., 

RA Konagaya A., Kurochkin I. V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L., Miki H., 

RA Nagashima T-, Numata K., Okido T . , Pa van W.J., Pertea G., Pesole G. , 

RA Petrovsky N., Pillai R. , Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T . , Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C, Semple C.A. , Setou M. , Shimada K., 

RA Sultana R. , Takenaka Y., Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R. , Wagner L. , Wahlestedt C, Wang Y., Watanabe Y. , Wells C, 

RA Wilming L.G., Wynshaw-Boris A. , Yanagisawa M. , Yang. I., Yang L. , 

RA Yuan Z., Zavolan M. , Zhu Y., Zimmer A. , Carninci P., Hayatsu N . , 

RA Hirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N., Sato K. , 

RA Shiraki T . , Waki K., Kawai J., Aizawa K. , Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K. , Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A. , Sakai K., Sasaki D., Shibata K., Shinagawa A. , 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J., 

RA Birney E. , Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573 (2002) . 

RN [2] 

RP SEQUENCE OF 100-311 FROM N.A. 

RC STRAIN=C57BL/6; 

RA Szendro P.I., Maevers K., Eichele G. ; \ 

RT "Cloning of mouse orexin receptors."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [3] .. fj 

RP REVIEW. 

RX MEDLINE=21237974; PubMed-11340621 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E.; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [4] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed-112 83317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM. , Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 



RL 
CC 
CC 
CC 
CC 
CC 
CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



Annu. Rev. Neurosci. 24:429-458(2001). 

--!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 

and orexin-B neuropeptides. 
-!- SUBCELLULAR LOCATION: Integral membrane protein. 
-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AK038551; BAC30039.1; -. 
EMBL; AK048781; BAC33457.1; -. 
EMBL; AF394597; AAK71327.1; -. 
MGD; MGI: 1889024; Mox2r. 
InterPro; IPR000276; GPCR_Rhodpsn . 
InterPro; IPR000204; Orexin_receptor . 
InterPro; IPR004060; 0rexin_receptor2 . 
Pfam; PF00001; 7tm_l; 1. 
Pfam; PF03827; Orexin_rec2; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 



DR 


PRINTS; PR01522; OREXIN2R. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN 


RECEPJF1 1; 1. 


DR 


PROSITE; 


PS50262; 


G__PROTEIN_ 


*RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Glycoprotein; Transmembrane. 


FT 


DOMAIN 


1 


54 


Extracellular (Potential) . 


FT 


TRANSMEM 


55 


75 


1 (Potential) . 


FT 


DOMAIN 


76 


88 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


89 


110 


2 (Potential) . 


FT 


DOMAIN 


111 


127 


Extracellular (Potential) . 


FT 


TRANSMEM 


128 


150 


3 (Potential). 


FT 


DOMAIN 


151 


172 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


173 


193 


4 (Potential) . 


FT 


DOMAIN 


194 


224 


Extracellular (Potential) . 


FT 


TRANSMEM 


225 


247 


5 (Potential) . 


FT 


DOMAIN 


248 


304 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


305 


327 


6 (Potential). 


FT 


DOMAIN 


328 


342 


Extracellular (Potential) . 


FT 


TRANSMEM 


343 


366 


7 (Potential) . 


FT 


DOMAIN 


367 


460 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


14 


14 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


22 


22 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


202 


202 ^ 


N-linked (GlcNAc. . .) (Potential) 


FT 


CONFLICT 


201 


201 


A -> T (in Ref . 2) . 


FT 


CONFLICT 


240 


240 


I -> V (in Ref. 2) . 


SQ 


SEQUENCE 


460 AA; 


52461 MW 


; D62A67C15BA67DCC CRC64; 



Query Match 67.9%; Score 1315.5; DB 1; 

Best Local Similarity 70.3%; Pred. No. 3.4e-82; 
Matches 251; Conservative 4 0; Mismatches 45; 



Length 460; 
Indels 21; 



Gaps 



4; 



Qy 

Db 



17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVT.IAAWAVFWALVGNTLVCLAVWRNH 
"M I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I | 
24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 



75 



83 



Qy 76 HMRTVTNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I I I I I I I I I ! I I I I : I I 
Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

QY 136 TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 

M ! I I I II I I I II I 1:1 I Ml:| I I II: I I I I I I : I I I I I : I I : I I I I I : 
144 TLSCIALDRWYAI CHPLMFKSTAKRARNS I WIWI VSCI IMI PQAIVMECS SMLPGLANK 203 

Qy 196 TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I Is: I I I ::lll:ll I I I •* I I I : I I I II :|| 1111111 ||||||:| : 
204 TTLFTVCDEHWGGEWPKMYHICFFLWYMAPLCIjyiILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMVVLLV 306 

Ml: : I : I I I I I I I I : I I : I I I I I I I : I I I I I i I ! 

Db 264 RKWKQ QQ P VS Q P RGS GQQS KARI S AVAAE I KQ I RARRKTARMLMWLLV 312 

Qy 307 FALC YL P I SVLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AANP 1 1 YN FL S G 363 

I I : I I I I I I : I I I I I I I I I I I III III I I I I I I I I I I I I II I I I I I I I I I I 
Db 313 FAI C YLP I S I LN VXKRVFGMFTHT EDRET VYAW FT FS HWLVYAN S AANP 1 1 YN FL S G 369 

RESULT 12 
AAR01328 



ID AAR01328 PRELIMINARY; PRT; 460 AA. 

AC AAR01328; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2b. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-BALB/c; 

RA Chen J. , Randeva U.S.; 

RT "Cloning and Characterization of the Mouse Type-2b Orexin Receptor 

RT Subtype (OX2bR)."; 

RL Submitted (JUL-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY336085; AAR01328.1; 

KW Receptor. 

SQ SEQUENCE 460 AA; 52461 MW; D62A67C15BA67DCC CRC64; 



Query Match > 67.9%; Score 1315.5; DB 2; Length 460;^ 

Best Local Similarity 70.3%; Pred. No. 3.4e-82; 

Matches 251; Conservative 40; Mismatches 45; Indels 21; Gaps 4; 

\ 

QY 17 S RE P S P VP P DYED- E FLRYLWRD YL YP KQ YEWVL I AAYVAVFWALVGNT LVCLAVWRNH 75 

-II I N: II I I I I I I : I I : I I : I I I I I I I | : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 7 6 HMRTVTNYFIWLSLADVXVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I | | : | | | : M I I I I I I I I I I I I : I I 
D b 84 HMRTWNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 



Qy 


136 


TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 
Ml 1 MM MM llhll llhlll M: M M MM M M: | I : | | III: 
TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 


Db 


144 


Qy 


196 


TRLFSLCHERWADDLYPKI YHSCFFI VTYLAPLGLMAMAYFQI FRKLWGRQI PGTTSALV 255 
1 M : : 1 II ::|||:M IMMMMM M Ml IMIIII 111111 = 1 : 
TTLFTVCDEHWGGEVYPKMYHI CFFLVTYMAPLCLMI LAYLQI FRKLWCRQI PGTS SWQ 263 


Db 


204 


Qy 


256 


RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMVVLLV 306 

III: MM 1 1 1 1 1 1 1 - 1 1 • 1 1 1 1 1 1 1 • 1 1 1 1 1 1 II 
RKWKQ QQPVS QPRGSGQQSKARISAVAAEIKQIRARRKTARMLMVVLLV 312 


Db 


264 


Qy 


307 


FALCYLPISVLm^KRVFGMFRQASDREAVYACFTFSHWLVY^SAANPIIYNFLSG 363 
M: M 1 1 1 1 M M 1 M 1 M 1 1 II 1 III 1 1 II 1 II 1 1 1 II 1 1 II II II 1 1 II 
FAICYLPISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVY7\NSAANPIIYNFLSG 369 


Db 


313 



RESULT 13 
AAR11293 



ID AAR11293 PRELIMINARY; PRT; 460 AA. 

AC ' AAR11293; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel . 27, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 

DE Orexin receptor type-2b. 

GN MOXR2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN' [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Chen J. , Randeva H.S.; 

RT "Genomic structure analysis of the Mus musculus orexin type-2 (MOXR2) 

RT gene . " ; 



RL Submitted (JUL-2003) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY339383; AAR11293.1; JOINED. 

DR EMBL; AY339384; AAR11293.1; JOINED. 

DR EMBL; AY339385; AAR11293.1; JOINED. 

DR EMBL; AY339386; AAR11293.1; JOINED. 

DR EMBL; AY339387; AAR11293.1; JOINED. 

DR EMBL; AY339388; AAR11293.1; JOINED. 

DR EMBL; AY339389; AAR11293.1; JOINED. 

DR EMBL; AY339390; AAR11293.1; -. 

KW : Receptor. < 

SQ SEQUENCE 4 60 AA; 524 61 MW; D62A67C15BA67DCC CRC64; 

Query Match 67.9%; Score 1315.5; DB 2; Length 460; 

Best Local Similarity 70.3%; Pred. No. 3.4e-82; 

Matches 251; Conservative 40; Mismatches 45; Indels 21; Gaps 4; 



Qy 17 S RE P S P VP P D YED- E FL RYLWRD YL Y P KQ YEWVL I AAYVAVFWALVGNT LVC LAVWRNH 75 

:M I I MM I I I I I I I I : I I : I I : I I | | | | | I : I I I I I I : I j I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRT VTN Y FI VNL S LADVLVT AI CL PAS LLVD I T E S WL FGHALC KVI P YLQAVS VS VAVL 135 



Db 



.84 



I I I I I I I I I I I I I I I I I I I I I I I II : I : I I I I I : I I I : I I I I I I I I I I I I I I : I I 

HMRTWNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 



Qy 


136 


Db 


144 


Qy 


196 


JJD 




Qy 


256 


Db 


264 


Qy 


307 


Db 


313 



TLSFIPLDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 
III I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I I : I I III: 
TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

TRLFSLCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 
I I I : : I I I : : I I I : I I I I I : I I I : I I \ II : I I I I I I I I I I I I I I i : I : 
TTLFTVCDEHWGGEVYPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMVVLLV 306 

III: : I : I I I I I I I I : I I : I I I I I I I : I I I I I I I I 

RKWKQ QQPVS Q P RG S GQQ S KARI S AVAAE I KQ I RARRKT ARMLMWL LV 312 

FALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AANP I 1 YNFLS G 363 
M : I I I I I I : I I I I I I I I I I I III III I II I I I I I I I I I I I I I I I I I I I I I 
FAICYLPISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSG 369 



RESULT 14 
OX2R_HUMAN 

ID OX2R_HUMAN STANDARD; PRT; 444 AA. 

AC 043614; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. .39, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN Name=HCRTR2; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98150861; PubMed=9491897 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A. , Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A., Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP SEQUENCE FROM N.A. . >= 

RX MEDLINE=21580342; PubMed=11723285 ; 

RA Olafsdottir B.R., Rye D.B., Scammell T.E., Matheson J.K., 

RA Stefansson K. , Gulcher J.R.; 

RT "Polymorphisms in hypocretin/orexin pathway genes and narcolepsy."; 

RL Neurology 57:1896-1899(2001) . 

RN [3] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=l 134 062 1 ; DOI=10 . 1002/bies . 1058 ; 

RA Hungs M. , Mignot E.; 

RT "Hypocretin/orexin, . sleep and narcolepsy."; 

RL Bioessays 23:397-408 (2001) . 



RN [4] 

RP REVIEW* 

RX MEDLINE=21178476; PubMed=112 83317 ; 

RA Willie J.T., Chemelli R.M. , Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness . " ; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC ' or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF041245; AAC39602.1; -. 

DR EMBL; AY062031; AAL47215.1; 

DR Genew; HGNC:4849; HCRTR2 . 

DR MIM; 602393; 

DR. GO; GO: 0005887; C: integral to plasma membrane; TAS. 

DR GO; GO: 0008188; F: neuropeptide receptor activity; TAS. 

DR GO; GO: 0007631; P: feeding behavior; TAS. 

DR GO; GO: 0007218; P : neuropeptide signaling pathway; TAS. 

DR GO; GO: 0007268; P: synaptic transmission; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR000204; Orexin_receptor . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_JL; 1. 

DR Pfam; PF03827; Orexin_rec2 ; 1. 

DR PRINTS; PRO 0237; GPCRRHODOPSN . 



DR 


PRINTS; PR01522; • 


OREXIN2R. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN_ 


RECEP Fl 1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN_ 


^RECEP_F1_2; 1. 


KW 


G-protein 


coupled 


receptor; 


Glycoprotein; Transmembrane. 


FT 


DOMAIN 


1 


54 


Extracellular (Potential) . 


FT 


TRANSMEM 


55 


75 


1 (Potential) . 


FT 


DOMAIN 


76 


88 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


89 


110 


2 (Potential) . 


FT 


DOMAIN 


111 


127 


Extracellular (Potential) . 


FT 


TRANSMEM 


128 


150 


3 (Potential) . 


FT 


DOMAIN 


151 


172 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


173 


193 


4 (Potential) . 


FT 


DOMAIN 


194 


224 


Extracellular (Potential) . 


FT 


TRANSMEM 


225 


247 


5 (Potential) . 


FT 


DOMAIN 


248 


304 


Cytoplasmic (Potential) . 


FT 


TRANSMEM 


305 


327 


6 (Potential) . 


FT 


DOMAIN 


328 


342 


Extracellular (Potential) . 


FT 


TRANSMEM 


343 


366 


7 (Potential) . 


FT 


DOMAIN 


367 


444 


Cytoplasmic (Potential) . 


FT 


CARBOHYD 


14 


14 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


22 


22 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


202 


202 


N-linked (GlcNAc. . .) (Potential) 



SQ SEQUENCE 444 AA; 50680 MW; CA0669F0D4224C65 CRC64 ; 



Query Match 67.8%; Score 1313.5; DB 1; Length 444; 

Best Local Similarity 70.0%; Pred. No. 4.6e-82; 

Matches 250; Conservative 37; Mismatches 49; Indels 21; Gaps 3; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVXIAAWAVFWALVGN^ 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRT VTN Y FI VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C KVI P YLQAVS VS VAVL 135 

I I I I I I I I I I I I I I I I I I I I I I II I : I : I I I I I : I II : I I I I I I I I I I I I I I : I I 
Db 84 HMRTVTNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIPLDRWYAICHPLLFKSTARR7VRGSILGIWAVSLAIMVPQAAVMQSSSVLPELANR 195 

III I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : I : I I III: 
Db 144 TLSCIALDRWYAICHPIMFKSTAKRARNSIVi™^ 203 

Qy 196 TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : : I Ml : : I I I : I I I I I : I I I .: I I I II : I I I I I I I I I I I I I I I : I : 
Db 204 TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAEVKQMRARRKTAKMLMWLLV 306 

Ml I I I I I I I I : I I : I I I I I I I : I I I I I I I I 

Db 264 RKWK PLQPVSQPRGPGQPTKSRMSAVAAEI KQI RARRKTARMLMWLLV 312 

Qy 307 FALC YLPI S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYANS AANP 1 1 YN FLS G 363 

I I : I I I I I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I 
Db 313 FAICYLPISILNVLKRVFGMFAHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSG 369 



RESULT 15 
AAG28021 



ID AAG28021 PRELIMINARY; PRT; 444 AA. 

AC AAG28021; 

DT 02-MAR-2004 (TrEMBLrel. 27, Created) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last sequence update) 

DT 02-MAR-2004 (TrEMBLrel. 27, Last annotation update) 

DE Hypocretin receptor-2 . 

GN HCRTR2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20429525; PubMed-1 09733 18 ; 

RA Peyron C, Faraco J., Rogers W., Ripley B., Overeem S., Charnay Y. , 

RA Nevsimalova S., Aldrich M. , Reynolds D., Albin R., Li R. , Hungs M. , 

RA Pedrazzoli M. , Padigaru M. , Kucherlapati M. , Fan J., Maki R. , 

RA Lammers G.J., Bouras C, Kucherlapati R. , Nishino S., Mignot E.; 

RT "A mutation in a case of early onset narcolepsy and a generalized 

RT absence of hypocretin peptides in human narcoleptic brains."; 

RL Nat. Med. 6:991-997(2000). 

DR EMBL; AF202091; AAG28021.1; 

DR EMBL; AF202085; AAG28021.1; JOINED. 

DR EMBL; AF202086; AAG28021.1; JOINED. 



DR EMBL; AF202087; AAG28021.1; JOINED. 

DR EMBL; AF20208 8; AAG28021.1; JOINED. 

DR EMBL; AF202089; AAG28021.1; JOINED. 

DR EMBL; AF202090; AAG28021.1; JOINED. 

KW Receptor. 

SQ SEQUENCE 444 AA; 50680 MW; CA0669F0D4224C65 CRC64; 

Query Match 67.8%; Score 1313.5; DB 2; Length 444; 

Best Local Similarity 70.0%; Pred. No. 4.6e-82; 

Matches 250; Conservative 37; Mismatches 49; Indels 21; Gaps 3; 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCIAVWRNH 75 

: : I I I I I : I I I I I M I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 76 HMRTVTNYFIWLSLADVLWAICLPASLLVT^ 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I I I I I I I I I I I I I I : I I 
Db 84 HMRTVTNYFIWLSLADVTjVTITCL 143 

Qy 136 TLSFIPLDRWYAICHPLLFKSTAi^RARGSILGIWAVSIAIMVPQA 195 

III I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I : hi I III: 
Db 144 TLSCIALDRWYAI CHPLMFKSTAKRARNSIVI IWIVSCI IMI PQAIVMECSTVFPGLANK 203 

Qy 196 TRLFSLCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : : I III : : I I I : I I 111:111:111 II : I I I I I I I I I I I I I I I : I .: 
Db 204 TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRG RAFLAE VKQMRARRKT AKMLMWL L V 306 

III I I I I I I I I : I I : I I I I I I I : I I I I I I I I 

Db 264 RKWK P LQ P VS QPRGPGQPTKS RMS AVAAE I KQ I RARRKT ARMLMWL L V 312 

Qy 307 FALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN FL S G 363 

I I : I I I I I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I 

Db 313 FAI C YLP I S I LNVLKRVFGMFAHT EDRETVYAWFT FS HWLVYAN S AAN P 1 1 YNFLS G 369 



Search completed: October 14, 2004, 10:55:49 
Job time : 98.8388 sees 



